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FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CAT AC AT T AC T G C AG T AAC AC T C C AC C AT AT AGAC C C G G C T T T AC C T TAT AT C AG T G AC AC T G G 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCAT TTAT GT T CGT TATAAGCAAGT T CATGC T C TGAGTCC TGAAGAGAACG T TATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT C AG T T T T G CACAG T GGC AAT T T T G G G AC T GAT T T AG AAC AG AAAC T C CAT T G GAAC C C C GAG G 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T G GAT T AAC C C T C TAT GAC AC T GCAC C T T GCC C TAT T AAC AAT GAAC GAAC AC GGC T AC T T T CCA 
GAG AT AT T TGA T G AAAG GAT AAAAT AT T T C T G T AAT GAT TAT GAT T C T C AG GGAT T GG GGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT G AAT GC T GAT AAT C AG GAAACAT G AAAGAAGC C AT T T G AT AGAT TAT T C T AAAGG AT AT CAT 
C AAG AAGAC TAT T AAAAAC AC C TAT GCC TAT AC T T T T T TAT C T CAGAAAAT AAAG T CAAAAGAC T 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX (S/T) : 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVR YKQVHALS PEENVT I KLNKAGLVLG I LSCLGLS I VAN FQKT T L F7AAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features : 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains : 

amino acids 54-73, 94-113, 160-180, 122-141 
N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
C T T ACAC T TCG CC ATGA GT T T CC T CAT CGACTCCAGCATCAT GAT TACCT C CCAGATAC TATTTTTTG 
GAT TTGGGTGG CT TT T CTTCATGC GCCAATT G T T T AAAGACT AT GAGAT AC GT CAGT AT GT TGT ACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TT T TCAT GGTG CCTT T TTACATTGGCTAT TTTAT T GT GAGCAATATCCGAC TACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
C T C AG GAAT GT G AC T G ACAC GGAT ATT C TAG C C CT GGAAC G GC G AC T GC T G C AAAC CAT GGAT AT GAT 
CAT AAG C AAAAAG AAAAGG AT GG C AAT G G CAC G G AG AACAAT GTT CCAGAAGGGGGAAG TGCAT AACA 
AAC CAT C AG GT T T C T G GGG AATG AT AAAAAG TGT T AC C AC T T CAG CAT C AG GAAGT GAAAATCT T AC T 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCT ACCAAGGAGAGAATAGAATACT CCAAAACC TTCAAGGGGAAAT ATTTTAATT T T CTT GGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
G GG AAAAC G GATC CT G TCAC AAGAGGCAT T GAGAT CAC TGT GAAT TAT C T G GG AAT C CAAT T T GATG T 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GG CAC CT TGAACT TAAGC CTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GAT ATAAGAGGGG GGAAAAAT GG AACCAG GG C C T GAC AT T T T AT AAACAAAC AAAAT GC T ATGGTAGC 
AT T T T T CAC CTT C AT AGC AT ACT CCTTCCCCGT CAG G T GAT ACT AT G AC CAT GAG T AGC AT CAG C CAG 
AAC AT GAGAGG GAGAACTAAC T CAAGAC AAT AC T CAG C AG AGAG CAT C C C G T G TGGATAT GAGG C TG G 
T GT AG AG G C G GAGAG GAG CC AAG AAAC TAAAG G TGAAAAATACAC TGGAAC TCTGGG GCAAGAC ATGT 
CTATGGT AGCT GAGC CAAAC ACGT AGGAT TT C C GT TT TAAGGTTCACAT GGAAAAGG TTATAGCTTTG 
CCTT GAGAT T GAC T CAT TAAAATC AGAG ACT G T AAC AAAAAAAAAAAAAAAAAAAAAGG G C GGC C GC G 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



App_ID=10063514 



Page 155 of 320 



FIGURE A 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGI IIVTSIRGLLITLTKFFYAISSSKS SNVIVLLLAQIMGMY 
FVS S VLL I RMSMPLE YRT 1 1 TE VLGELQFNFYHRVJ FDVI FL VS AL S S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotio cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE B 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTC ATGCAGAAT TATGGGGAT CACC C T TGTGAGCAAAAAGGCGAACCAGCAGC TGAAT T T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCC GACAGT T T GCAGC C TAT T GT TACAAC TCATC TGAT AC T TGGAC T AACT CGTGCAT TC CAGAA 
AT TAT C AC CAC C AAAG AT C C CAT AT T C AAC AC T C AAAC T G C AAC AC AAAC AACAG AAT T TAT T G T 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
G AAAC T AGC AC C AT GTC T AC AGAAAC T GAAC CAT T T G T T G AAAAT AAAG C AG CAT T C AAG AAT G A 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAAT GATAGCAACCCTAA 
T GAG GAAT CAAAGAAAAC T GAT AAAAAC CCAG AAG AG T CC AAGAG T C CAAG CAAAAC TAC C G T GC 
GATGCC TGGAAGCT GAAGT T TAGAT GAGACAGAAAT GAGGAGACACACC T GAGGC TGG TT TC T TT 
CAT GC TCC T T AC CC TGC CC CAGCTGGGGAAAT CAAAAGGGC CAAAGAAC CAAAGAAGAAAGT CCA 
C CC T TGGT T CC T AAC TGGAAT CAGCT CAGGAC TGCCAT TGGAC TAT GGAG T G CACC AAAGAG AAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GC T C T GAAAGAGAAAC ACG T AT CCC AC C T GAC ATGT CC T T C TGAGCC C GG T AAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCT AAAAT AAAGAAATAGAACAAGGCT GAGGATACGACAGTAC AC T GT C AGCAGGGAC TGTAAAC 
ACAGACAGGG T CAAAGTGT T T TCTCT GAACACAT TGAGT T GGAATCAC T G T T TAGAACACACAC A 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC T C T T AT AAAT T T CT AT T T T TAT C TGAGT T AC AGAAAT GAT TAC T AAGG AAG AT T 
AC T CAG T AAT T T G T T TAAAAAG T AAT AAAAT T C AAC AAAC AT T T G C T GAAT AGC TAC TAT AT GTC 
AAG T G C T G T G CAAG G TAT TAC AC T C T G T AAT T GAAT AT TAT TC C T C AAAAAAT T GC AC AT AG TAG 
AAC GC TAT C T GGGAAGC TAT T T T T T TCAGTT T TGAT AT T T C TAGC T TAT C TACT T C CAAACT AAT 
T T T TAT T T T T GC TG AG AC T AAT C T TAT T CAT T T T C T C T AAT AT G GC AAC CAT TAT AAC C T T AAT T 
TAT TAT T AAC AT AC C T AAG AAG TAC AT T G T T ACC T C TAT AT AC C AAAG CAC AT T T T AAAAG T G C C 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
T G T GAC AAAAAAT T AAAG CAT T TAG AAAAC T T 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S L AGKDQVE T ALKAS FE T C S YGWVG DGFWI S R I S PNPKCGKNG VGVL I WKVP VS RQ FAAYC YN 
SSDTWTNSCIPEI ITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTEVFMET S TMS TE TE P FVENKAAFKNEAAG FGGVPTALLVLALLF FGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

ammo acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 

CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT T GC GCAGCGCGG T GGAAGAGATGGAGGCAGAAGAAGC TGC T GC T A 
AAG C AT CAT C AGAAG T G AACC T G G C AAAC T T AC C T C C C AGC TAT C AC AAT GAGAC C AAC AC AG AC 
ACGAAGGT TGGAAATAATACCAT CCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
T G G AC AAAT GGTCTTTT C AG AG AC AG T TAT C AC AT C T G T G G GAGAC G AAG AAGG C AG AAG G AGC C 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

G AGAT CC T GC T GC C CAG AGAGG T CC CC GAT GAG TAT G AAGT T G GC AGC T T CAT G GAGGAGG T G C G 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
T T T G T T CTACATGGC T T T GAT AAT T G T T T GAGGGGAGGAG AT GGAAACAAT G T GGAGT CTCC CT C 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAMCATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
C TAC T GT T AGGAACAGC AG TG T T C T CACAG TGT GGGGC AGCCG TCCTT C TAAT GAAGACAAT GAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGC 
AAAAT C ACT TAG CAG C AAC T GAAGACAAT TATCAACCACGTGGAGAAAATCAAACCGAGCAG GG C 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T G TAT AAG CATGC T T TC T T T GAGT T T TAAAT TAT G TAT AAAC AT AAGT TG CAT T TAG AA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATL1SFEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEVNLANLPPS.YHNETMTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGS FMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence : 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site, 
amino acids 140-144 
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FIGURE Q 

CGGA C G C G TGG GCG GAC GC GT GGG GG C T GT GAGAAAG T G C C AAT AAAT ACAT CAT G CAACCC CAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACC T GGAGC GGAACAACGGC T CCCTGGAC CGGCCC TAG TACAT GTC CAAGAGC C T TCTAA 
AGAT TC T G GG CAAG AAGAACGAG GC G C C CC CG GACAAC AAGAAGAGG AAGAAG TGACAG C T C CG G 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
T T G AGAGG C T GAGG C G G GC GGAT CACC T GAG T GAG GAGT T C GAGACC AGCC T GG CC AACAT G G T G 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT C GC GC CAC T G CAC T C C AAC C T G G GT GAC AG AC TCTGTCTC CAAAAC AAAAC AAAC AAAC AAA 
AAG AT T T TAT T AAAG AT AT T T T G T T AAC T C 
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FIGURE io 

RTRGRTRGGCEKVP I NTSCNPTAHLVNS S C PGLMCVFQGYS SKGL I QRSVFNLQI YGVLGLFWT L 
NWVLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNPJ^AYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 2 7 6-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AAC AG C C TG AT GCAG AC C G AGGAG T GC C C AC C TAT G C TAGAC G CC AC AC AG CAGC AG GAG CAG C AG G T 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
C CT CTAC CT GT GGAC CCTGGTAGC CCCACTCC T CCT GCGCAACCG CGAC TT CAGC TGAG G CAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GAC T TCGTGCC TTACT GAGT CTCT AAGAC TTTT TCTAATAAACAAGCCAGT GCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPC I LCSCCPASRNSTVSRLI FTFFLFLGVLVS I IMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVl^SLHWMTLTNWYKPGETRKMISTWTAVWWICASWAGLLLYLWTLVAPLLLRNRD 

FS 

Signal sequence : 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
G GT T GG AAAAAG AC T C C T G T AAC C C T C C TCCAGG ATGA AC C AC C T GC CAG AAGACAT G GAG AAC G 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
C T CAT G GC CAG GAT T GAG T CC TAT G AAG G AAG G G AAAAG AAAG G CAT AT C T GAT G T CAG GAG G AC 
T T T C TG T TT G T T T G TC ACCT T TGACC T C T TAT TCG T AACAT TAC T GT GGAT AATAGAGT T AAAT G 
T G AAT G GAGG C AT T G AG AACACAT T AG AGAAGGAG GT GAT GCAGTAT GACT ACTAT TCTT CATAT 
T T T GATATAT T T C TTCTGGCAGT T T T T CGAT T TAAAG T G T TAATAC T TGCATAT GC T G TGT GCAG 
AC T G C G CCAT T G G T G G G C AAT AG C G T T G AC AAC G G CAG T G ACC AG TGCCTTTT TAC TAG C AAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAG AC T CC T GAT AG T T CAG GAT G C T T C AGAGAGG G CAG CAC T TAT AC CTGGTGGTCTTTCT GAT G 
G T CAG T T T TAT TCCCCTCCT GAAT C C GAAGCAGGAT C T GAAGAAG C T G AAGAAAAACAG G AC AG T 
G AG AAAC CAC T T T TAG AAC T ATGAG TAC TAC T T T T G T T AAAT G T GAAAAAC CC T CAC AG AAAG T C 
ATCGAGGCAAAAAGAGGCAGGCAGT GGAGTC TCC C T G T CGACAGT AAAGT T GAAAT GGT GAC G T C 
CAC TGCTGGCTT TAT T G AAC AGC T AAT AAAG AT T TAT T TAT T G T AAT AC C T CAC AAAC G T T G TAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T T AAT AT AT T GAAG T AAC AC T T TTT TAG T AAGC AAGAT AC C T T T T TAT T TCAATT CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
T CCCT TGT T T T T TAAC TCAT GCACATGTGC TC TT TGTACAG T T T TAAAAAGT GT AATAAAAT C T G 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T G C AAAAG G G GAAG AAAG GAAT T G C GAAT AC AT G T AAAAT G T CAC CAG AC AT T T G TAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
C T AAAT T AAAAA 
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FIGURE 14. 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI I ELNVNGG I ENTLEKEVMQYDY YS S YFD I FLLAVFR FKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids" 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



AppID-10063514 



Page 166 of 320 



u..J> u.„r ...s., - & '^u'a ...ti., n,,.}' » 



FIGURE is 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
G G C C T G T AC T G T GAG AG C C AG AT G G G G C AG G G G AC AC G G C C C AGC C C T AC AC C AG T C AC G C C GAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
AC T CAT GG GC T T CCCAGGGCC T GGCC T CCAGT CACC CCT C CAC GC AAAGCC C T ACAT C TAAG C C A 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T TAG G AAC A TGTTTTGCTTTTT T AAAAT AT AT AT AT T T AT AAG AGAT C C T T TC CC AT T TAT T C T 
G G G AAG AT G T T T T T CAAAC T C AG AG AC AAGGAC TTTGGTTTTTG T AAG AC AAAC GAT GAT AT G AA 
GGCCTTTTG T AAG AAAAAAT AAAAG AT G AAG T G T G AAA 



AppJCD=10063514 



Page 167 of 320 



FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNR I RH I QPGAFDT LDRLLELKLQDNELRALP PLRLPRLLLLDLSHNS LLALE PG I LDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
T RCH FP PKNAGRLL LE L DYAD FGC PAT T T TAT VP T T R PWRE P T AL S S S LAP T WL S P TAP AT EAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAAIJWVVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-2 3 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGATTCCAAGACTACTT TGACAT CAGATGAGTCAGT A 
AAG GAC CAT AC T AC T G C AG GC AG AG TAG TTGCTGGT C AAAT AT T T C T T GAT T C AG AAG AAT C T G A 
AT TAGAATCC TCTATT CAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAG AT AT C AGC T T T C T AGAGT C T C C AAATC C AGAAAAC AAGGAC TAT GAAGAGCCAAAGAAAG T A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T CC T AGAT AAG GAG TAT GATGAAT G T AC AT CAG AT G G GAGG GAAG AT G G CAG AC TGTGGTGTG 
C T AC AAC C TAT GAC T AC AAAG C AGAT G AAAAG TGGGGCTTTTGT G AAAC T G AAG AAG AGG C T GC T 
AAGAGAC GGC AG AT GC AGG AAGC AG AAAT GAT GT AT C AAAC T GGAAT GAAAAT C C T T AAT G GAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATAT CGGTATCT CCAAAAGGCAGCAAGCATGAACC AT A 
C CAAAG CC C T G GAG AG AG T G T CAT AT G C T C T T T TAT T T G G T GAT T AC T T G C GAC AG AAT AT C CAG 
GC AG C G AGAG AG AT G T T T GAG AAG C T GAC T GAGG AAG G C T C T C CC AAGGG AC AG AC TGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AAT AAT AT T AAC AT CAG AAG AAT TTGTGGTT T AT AGCG G C C AC AAC T T T T T CAG C T T T C ATGAT C 
CAG AT T T GC T T GT AT T AAGACCAAATAT T CAG T TGAAC T T CC T T C AAAT T C T T G T T AATGG AT AT 
AACACATGGAATCT ACATGTAAATGAAAGT TGGTGGAGTCCACAATTT TTCT T TAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
G AAT T G T CAG AAT C AT T T T T T AC AT TAG AT TAT CAT AAT T T T AAAAAT T T T T C T T TAG T T T T T C A 
AAAT T T T G T AAAT G G T G GC T AT AGAAAAAC AACAT GAAAT AT TAT AC AAT AT T T T G CAAC AAT G C 
C C T AAG AAT T G T T AAAAT T CAT G GAG T TAT T T G T G CAG AAT GAC T CC AG AG AGC T C T AC T T T C T G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAG AT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 

S SQAKALVY YT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal . 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 



AppJD=l 00635 14 



Page 170 of 320 



.„ii„ u.Ji s^i* «,i:„ »,i 'i.J' "L,}i ,„u„ 



FIGURE 1Q 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 

AAC C T C AG AGAG AC T T C GT AC TGTGCTTCT GG AT G T GAC C G AC C C AGAG AAT G T C AAGAG GAC T G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT T GAACC AG GAT T G T T CAAAAC AAACT TGGCAGAT CCAGT AAAGG T AAT T GAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 

AG T C T AG ACAAAC T G AAAGGC AAT AAAT C C TAT G T G AAC AT G GAC CTCTCTCCGG T GG T AG AG T G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CAT C T C T TAT C T AAAT AT T AAAAG AT AAG T C AAC C C AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

ML FWVLGLL I LCG FLWTRKGKLK I EDI TDKY I F I TGCDSG FGNLAART FDKKG FHVI AAC LTES G 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

ammo acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site . 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
G G T G AAG C C AAG AACAGCAT T AC T GAT T CC C AAAT G GAT GAT G T T GAAG T T G T T T AT AC AAT T G A 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AG C AAG C AC T G AAG AAAAT AT TAT C AAAT G T C AAAAAG AAT G T GG TAG G T T G G T AC AAAT T C C G T 
CGTCAT TCAGATCAGATCATGACGTT TAGAGAGAGGCTGCTTCACAAAAACT TGCAGGAGCATTT 
T T CAAACC AAGACCT T GT T T T TCT G CT AT TAACACC AAG T AT AATAACAGAAAGC T GC TC TAC T C 
AT CGAC TG GAAC AT T C C T TAT AT AAAC C TCAAAAAGG ACT T T T TCACAGGGTAC C T T TAG T GG T T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T T TAG C C GAG CAGT AC AAAC AC AC AG C TC T AAAT t T T T T GAAGAAG AT G GAT C C T T AAAG G AG G 
T AC AT AAG AT AAAT G AAAT GT AT G C T T CAT T ACAAG AG GAAT T AAAG AG T ATAT GC AAAAAAG T G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
G AAAAG GAGAG G AGC AC AG AT T C AGGC AGC AAGAG AG AAG AAC AT CC AAAAAGAC C C T C AG GAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AG AC AAT C T G AC C T T AAT G GT AG AACAC AC TG AC AT T C C T GAAGC TAG T CC AGC T AGT AC AC CAC 
AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 
GAT ACACAAGAC AAAC GAT C T AAAG CAAAT AC T G G TAG TAG T AAC CAAG AT AAAGC AT C C AAAAT 
GAGCAGCC CAGAAACAGATGAAGAAAT TGAAAAGAT GAAGGGT T T TGG T GAATAT TCACGG T CT C 
CT AC AT T T TGAT C C T T T T AAC C T T ACAAGGAG AT TTTTTTATTTGGCT GAT G G G T AAAG C CAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T G T T TG CAGTAAT ACAC AGATAAC TCT TAG TGC AT T TAC T T CACAAAG TAC T T T T TCAAACAT CA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
AC AC AT T C T T TAG AAT T GGAAAAG T GAG AC C AGG C ACAG T GGC T CAC AC C T G T AAT C C CAG CAC T 
T AGGGAAGACAAGT CAGGAGGAT T GAT T GAAGC TAGGAGT TAGAGACCAGCCTGGGCAACG T ATT 
GAG AC CAT G T C T AT TAAAAAAT AAAAT G G AAAAG CAAG AAT AG C C T TAT T T T CAAAAT AT G G AAA 
G AAAT T T ATAT G AAAAT T T ATC T GAG T CAT T AAAAT TC TC C T T AAGT GAT AC T T T T T TAG AAG T A 
CAT TAT GGC TAG AG T T GC CAG AT AAAAT GC T GGAT AT CAT G C AAT AAAT T T G C AAAAC AT C ATC T 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFS FYNS S GE VNEQALKK I L SNVKKNWGW YKFRRHS DQ IMT FRERLLHKNLQEH FS NQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEAS PASTPQI IKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation si tes . 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACC ATG GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
C AAGAGATGGAGGAGAAGT ACAGCCT GCAAAGT AAAGAT GATT T CAAAAGT GTGGTCAGC GAACAGT GC AAT CATT T G 
CAAGCTGTCTTT GCTT CACGTTACAAGAAGTTT GATGAATT CT T CAAAGAACT ACTT GAAAAT GCAGAGAAATCC CTG 
AAT GATATGT TT GTGAAGACAT AT GGCCATT TAT AC ATGCAAAATT CT GAGCTATTT AAAGAT CT CTT C GT AGAGTTG 
AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
T T CCGCCT G GT GAACT CC CAGT ACCACTT T ACAGATGAGTAT CT GGAATGT GTGAG CAAGT ATACGGAG CAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
T GTTT GGCCAACCAAG GGGATCT CGAT TT T GAATG GAACAATT T CAT AGAT G CT ATG CT GAT GGT GG CAGAGAGGCTA 
GAGGGTC CT T T C AAC ATT GAAT CGGT CAT G GAT CCCATCGATGT GAAG AT T T CT GAT G C TAT TAT GAAC AT GCAG GAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTT GGACCGACTGGTTACTGATGT CAAGGAGAAACTGAAACAGGCCAAGAAATTCT GGT CCTCCCTTCCGAGCAAC 
GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTTTGCAGT GACAGGAAATGGATT AGC CAAC CAGGGCAACAACC CAGAGGTC CAGGTT GACACCAGCAAACCAGAC 
ATACT GAT C CTT CGT C AAAT CAT GGCTCTTCGAGTGATGACCAGCAAGAT GAAGAAT GCATACAATGGGAACGACGTG 
GACTT CTT T GAT AT CAGTGATGAAAGTAGTGGAGAAGGAAGTG GAAGT GGCTGT GAGTATCAG CAGT GCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCTCCTCA CTGTCTTCTGCATCTTGTTCCTGGTTAT GCAGAGAGAGT GGAG ATAAT T CTCAAACTCT GAG 
AAAAAGTGTTC^TC7UUy\AGTTAAAAGGCAC(^GTTATC^^ 

TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCTGAAATATTAAATAGCTGTACAGAAGCAGGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24. 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVSVVNPTAQCTHALLKMI YCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KS ANE KAD SAGVR P GAQAYLLTV FC I L FLVMQRE WR 

Important features : 
Signal peptide: 

amino acids 1-22 

AT P/GTP -binding site motif A (P-loop) . 

amino acids 515-524 

N-gly cosy 1 a tion s i te . 

amino acids 514-518 

Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins . 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
T AC AGC TG CACC GACAG T T GCGATGAAAGT T C TAAT CT C T TCCC TCC T CC TG T TGCT GCC AC TAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAAT GTGAAGAAAACAAGACACCAAAGGC AC CACAGAAAGCCAAACAAGCAT T CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAGG AGCTCTG 
AGCGCCCACTCTTCCAAT TAAACAT TC TCAGCCAAGAAGACAGTGAGCACACCTACCAGACACT C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTG TCAT T T AACC TTAAAT GCAAT C AGGAAAG T AGCAAAC AGAAG T CAAT AAATAT T T T T 
AAAT GT CAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide : 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



G GACGC CAGC GC C T G C AGAGG C T GAG C AG G GAAAAAG C C AG T G C C C C AG C G GAAGC AC AG C T C AG 
AGCTGGTCTGCC ATGG ACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCT GAC T C C C AAGAG C AACCG CAAG AT G GAG AG C AAGAAACGG GAG C T C T T CAG C CAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAAC T T TCAGT T C T AC C CACCGGGC T GCAGGG TCACCT GC C TAGACC CAAATCCCCAC T T T GAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCC AC C T G GAAAC AC AT TGG GGAT GGCTGCTGCCT CAC CAGAGAG AC C T GG AAGGAT C T T G A 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T T C C C CAG C C T C C AAT T AG AAC AAG C C AC C CAC CAG C C TAT C TAT C T T C CAC T GAG AG G GAC C T A 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AAT C T C TAAC T T CAAT C CC GCC T TC GACAGT GAAAAAGC T C TAC TT C TACGCT GACCC AGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACT CAAAGAGGCAGAGGT T T T GT T C TCAAATAT T T T T T AAT AAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide : 

amino acids 1-23 

Leucine zipper pattern, 

amino acids 10-32 

N-myristoylafcior 1 sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATG CTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAG C T GG GAG AT AGGGAACAG AAGAGGG T AG TGGGTGGGC TAG GGGGGCTGCCT TAT T T AAA 
GTGGTTGTT TAT GATTCT TAT AC T AAT T T ATACAAAGAT AT T AAG GC CC T G T T CAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AG T T AAAAGC T G AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE ai 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCT^AGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATC ATG TCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTT T ATGG T G GC T GT AT TAT GAC T AT ACC AAC GACC T C AGCAT AGAAT T G GAC A 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAAT CATT GTCATGTACAT GCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC CAGAAT GCAT ATAC TACAAC T GC T AT TAATGGG AC AGAT T T CT GTACAT CAGCAAAAGATG C 
ATT CAAAAT C T T G T CCAAGAAC T CAAGTCAC T T T AC AT C T AT TAAC T GC T T T GGAGACT T CATAA 
T T T T T C TAGGAAAGGT GT TAG TGG T G T GT T TCAC TG T T T T TGGAGGACT CAT GGC T T TTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGG AG C AAC AAAT T AAACAAT G C AAG G G C AC AG C AG G ACAAGCAC T CAT T AAG GAAT GAG G A 
G G G AAC AGAAC T C C AG G C CAT T G T G AG ATAGA T AC C CAT T TAG G TAT C T G T AC C T G G AAAAC AT T 
T CC T TC TAAGAGCC AT TTACAGAAT AGAAGATGAGACCAC TAGAGAAAAG T TAGT GAAT t T T T T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAMMFT FRFITTLLVHI FISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHP ILS SLS I LFFYHQGT WKGS FL I S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE aa 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 

ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 

GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTT ATGA GGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

AT T CAAAC AAAG AAAC G GC AAAG AAG AT T AAAAG G C C C AAG T T CAC T G T G C C T C AG AT CAAC T G C 

GAT G T C AAAG C C G G AAAGAT CAT C GAT C C T GAG T T CAT T G T GAAAT G T C C AG C AGG AT G C C AAG A 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 

GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 

TAT C G T C T TAG AAAG T AAAC CCAAAAAG GG T G T AAC C T AC C CAT C AG C T C T T ACAT AC T CAT CAT 

CGAAAAGTCCAGC T GC C CAAGCAGG T GAGACCAC AAAAGC C TAT CAGAGGC CAC CTAT TCCAG G G 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 

AGAG C T GAT CCAG G TAT CC AAAG GC AAGATC C T T CAG GAG C T GC C T T CC AGAAACC T G T T G GAG C 

G GAT G T CAGCC T G GGAC T T G T T C CAAAAGAAG AAT T GAG C AC ACAG T C T T T G GAGC CAG TAT C C C 

TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 

AC ACGAAT T C T C GAGAT C T GAAG AC AG CCAT AGAGAAAAT T AC T C AGAGAG GAG GAC T T T C T AAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GAC T T G C GAG AG AG T C AG G AAT CAAC AT T T T C T T CATC AC CAT T G AAGG TG C T G C T GAAAAT GAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAG T T T GAGAT T TCC GACACGGACAC GCGCAT C GGGG CC G TGCAGT ACAC C TACGAACAGC GGC 

TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 

GT T TGACAACC T CC AT CAG TATGTC CCCAGGAT CAT CC AGAAC AT TT GT AC AGAGT TCAAC T CAC 

AGC CT CGGAAC TGAA T T CAGAGC AGG C AGAG CAC CAGCAAG T GCTGC T T TACTAAC TGAC G T G T T 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

T GAT CACAAAC G T AT AGAAT GAG C C AAAAG G C T ACAT CAT G T T GAGG G T G C T GG AG AT T T T ACAT 

TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T T T T G T CAT GAC AAT G TAG GAAT T G C T G AAT T AAAT G T T T AGAAG GAT G AAAAAT AAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWL TMKASV I EM FLVLLVTGVHSNKE TAKK I KRPKFT VPQ INCDVKAGK 1 1 DPE FI VKC PAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S F I VL E S K P KKG VT Y P SAL T Y S S S KS PAAQAG E T T KAY QRP P I P GT T AQ P VT LMQ L L AVT VAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RS GAP1WVVVMVDGW PT DKVEEAS RLARE SGINIFFITIE GAAENE KQYWE PNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 
GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
ACACTGAACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTCAGG 

CC CTTC AAAG ATGAT ATG T GGAG GAAT T AAAT AT AAAG GAAT TG GAGGT T TTT GCT AAAGAAATT AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT TATTT AAAATTACT TCAACT TTGTGTT TTT AAAT GT TT T G ACG AT T T C AAT AC AAG AT AAAAAGG AT AG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AG AG AAG AAT T AAAGC AAGAAAATCTG AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
T LREHSNCS HQNP FLVI LVT SHPS DVKARQAI RVT WGEKKS WWGYEVLT FFLLGQE AEKE DKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG I CLNLLKVN I H I PE DTNL FFL YR I HLDVCQLRRVI AAHG FS S KE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

ammo acids 23-37 

N-myristoylation site. 

amino acids 271-277 



\ 
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FIGURE 3Z 

CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGT C CTTGCC AAGAGAGTAC AC AGTCAT TAATGAAGCC TGCC C T GGAGCAGAGTGGAAT ATCATGTG TCGGGAG TGCTGTG 
AATATGATCAGAT TGAGTGC GTC T GCCCC GGAAAGAGGGAAG TCG TGGGT T AT AC C ATCC C T TGCTGCAGGAATGAG GAGAA 
TGAGTGT GAC T C C TGC C TGATCC AC CC AGGT TGTAC C ATC T TT GAAAAC TGC AAGAGCTGC CG AAATGGCT C ATGGG GGGGT 
ACC TTGGAT GAC TTC T ATGT GAAGGGGTTC TACTGTGCAGAGT GC CGAGC AGGCT GGTAC GGAGGAGAC TGCATGC GATGTG 
GCCAGGT TC TGC GAGC CCC AAAGGGTCAGATTTTGTTGGAAAGC TATC CCC TAAATGCTC AC TGTGAATGGACCATTCATGC 
TAAACCTGGGT T T GTC ATCC AAC TAAGAT TTGTCATG TT GAGTC T GGAGT T T GAC TACATGTGCCAGTATGAC TAT GT TGAG 
GTT C GTGAT GGAGAC AACCGC GATGGCCAGATCAT CAAGCGTGT C TGTGGC AACGAGCGGCCAGCTCC TAT CCAGAGCAT AG 
GAT C C TCAC TCC ACGTCC TC T TC C ACTCCGATGGC TCCAAGAAT T T TGACGG T T TCC ATGC CAT TTATGAG GAGAT CAC AGC 
ATG C T CC TC ATC C CC T TGT T TCC AT GACGGCACGTGCGTC C TTGACAAGGC T GGATC T T AC AAGTGTGCCT GC TTG G C AGGC 
TATAC TG GGC AGC GC TGTGAAAATC TCC T TGAAGAAAGAAAC TGC TC AGAC C C TGGGGGC C CAG TCAATGGGTAC CAGAAAA 
TAACAGGGGGCCCTGGGC T T ATC AACGGACGCC ATGC TAAAAT TGGC ACCGTGGTGTCTTTCTTTTGTAACAAC TC CTATGT 
TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAAGCCTGCCGA 
GAACCAAAGAT T T CAG ACC T GGTGAGAAGGAGAGT TC TT CC GAT GCAGGTT C AGTCAAGGGAGAC ACCATTACAC CAGC T AT 
ACTCAGCGGCCTTC^GCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATAC CAACATC TGC AT ACCC AGC TCC AGT ATGAGT GC AT C TC AC C C T TC T AC CGCCGCC T GGGC AGC AGCAGGAG GAC AT GT 
CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
CCCAAGGGT TGC GC TGGC C GTGGC AGGCAGCCATC T ACAGGAGGACC AGC GGGGT GC ATGACGGC AGC C TAG AC AAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCAC CATGAT C AAGAC AGC AGACC TGAAAGT TGTT T TGGGGAAAT TC TACCGGGATGAT GACCGGGATGAGAAGAC CAT CC 
AGAGC CTACAGATTTC TGC TATC AT TC TGCATCCCAACT AT GAC C CCATCC TGCT TGATGC TGACATCGCCAT CC T GAAGC T 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCC C ACATCAC TG TGGC TGGC TGG AATGTCCTGGCAGAC GTGAGGAGC CCT GGC T TC AAGAACGACAC ACT GCGC TC TGGGG 
TGGTC AGTG TGGTGGAC TC GC TGC T GTGT GAGGAGC AGC AT GAGGAC C ATGGC AT CCCAGTGAGTGTCACTGAT AAC ATGT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAG CAT CTCC T GAGC C AC GC TGGC ATC T GATGGGAC TGGT CAGCTGGAGC T ATGATAAAACATGCAGCCACAGG C T CTC C A 
CTGCCTT CACC AAGGT GCTGC C T T T TAAAGAC TGGAT TGAAAGAAAT AT GAA ATGAA CC ATGC TC ATGCAC TCCTT GAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
CTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCA 
CTCCACTGACC TGGTGGTC T TCCCCAACT TTCAGTTATACGAATGCC ATCAGC TT GACC AGGGAAGATC TGGGCTTC ATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063514 



Page 189 of 320 



FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 

IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ, 

VLP^PKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVKDGDNRDGQII 

KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 

ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 

KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 

KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 

KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 

KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP I LLDADIAI LKLLDKARI STRVQP ICLAASR 

DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 

SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 

ERNMK 

Important features of* the protein: 
Signal peptide: 

amino acids 1-2 3 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

ammo acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAG AATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTGAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
AC CCT G CCT ACGT GT C CT C G G CAGAG GACGGGCAG CCAGCAAT CAGC CCAGTG GACT CT GGCCG GAG CAAC CGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATT CTGAAAACAC CACT GCCC CTGAAG 
T CTTT CCAAGGT T GT ACCACCT GATT CCAGATGGT GAAATT AC CAGCAT CAAGAT CAAT CGAGT AGAT C CCAGTGAAA 
GCCTCTCTATTAGGCT GGT GGGAGGTAGCGAAACCCCACTGGT CCATATCATTATCCAACACATTTAT CGTGATGGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAG GAACAAT GGACAGG C CC C GGAT G C CTACAGAC CCCGAGAT GACAGCT TT CATGT GATT CT CAACAAAAG TAG CC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GT GG CAT AT C GAC AT GGT CAGCTT GAGGAGAAT GACCGT GT GT TAGCC AT CAATG GACAT GAT CTT C GAT ATG GCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAG CCCT GACAT CT T T CAG GAAGCC GG CT GGAACAGCAAT GGCAGCTGGT CC CCAG GG C CAGGGGAGAGGAGCAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 
TCATAAGCAGAGAT GGAAGAAT AAAAACAGGTGACATTTT GTT GAAT GT GGATGGGGT CGAACT GACAGAGGTCAGCC 
GGAGTGAGGCAGTGGCATTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGC 
CC CAGGAA GACT GCA GCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 
TGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCTTTTTTCATCAAATCCATTGTTGAAGGAA 
CACCAGCAT ACAAT GATGGAAGAATT AGAT GT GGT GAT ATT CTTCTTGCT GTCAAT GGTAGAAGT ACAT CAG GAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTTT ATAG AATCAAT GATGGGT C AGAGGAAAACAGAAAAAT CACAAAT AG G CTAAGAAGTT GAAACACT AT AT TT ATC 
TT GTCAGT T T TT ATATTTAAAGAAAGAAT ACATTGT AAAAATGT CAGGAAAAGTAT GAT CAT CT AAT GAAAG CCAGTT 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACT ACT AGTTTTTTTT CAGT GTGGAGGATTT CTCATTACTCTAC 
AACATTGT TTAT ATT TTTT CT AT T C AAT AAAAAGC C CTAAAAC AACT AAAATGAT T GAT TT GTAT AC C CCACTGAATT 
CAAGCTGAT T TAAATTTAAAATT T GGT AT ATGCT GAAGT CT GC CAAGGGT ACAT TAT G G C CATT T T TAAT TT ACAG CT 
AAAAT ATT T T TT AAAATGC ATT GCT GAGAAAC GTT GCTTT CAT C AAA CAA GAAT AAAT ATT TTT CAGAAGT T AAA 
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FIGURE go 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
AL5VLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHII IQHI YRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRS RNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG IKL VRKVDE PGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTO\GTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCA^CTTGAAGCTTT 
CTTGCCT GCAG TGAAGCAGAGAGATAGAT AT TAT TCACGTAATAAAAAACATGGGCTTCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGT AACTACTTC GTGGGTGC CAT TC AAGAGAT T CCTAAAGCAAAGGAGT TCATGGCTAAT T TCC 
ATAAG AC CCTC ATTT T GGGGAAG G GAAAAAC T C T GAC T AAT GAAG CAT C C AC G AAGAAGGT AGAACT T 
GACAACT GT CCTTCTGTGTC TCCT TACCT CAGAGGCCAGAGCAAGCT CATTT TCAAACCAGATC T CAC 
T T T GG AAGAGG T ACAG GCAGAAAAT CC C AAAG T GT CC AGAG GCCG GT AT CG C C CT CAGGAAT GT AAAG 
CTT TACAGAGGGT CGC CATC CTCGTTCCC CAC CGGAACAGAGAGAAACACCT GAT GTAC C T GCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
T AAAAAG T T TAAT CGAGCC AAAC T C TT G AAT GT GGGC TAT C T AGAAG C C C T C AAG GAAG AAAAT T GGG 
AC T GC T T TAT AT T CC AC GAT GT G G ACC T G GT AC C C GAGAAT GAC T T T AAC CTT TAC AAG T GTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCT GAAGT GGGT AAAT ATAC AATGGTCTT CCAC ACT AGAG AC AAAG GC AAT GAGGT GAAC GCAGAACG 
GAT GAAG CT CT T ACAC C AAG T GT C ACG AG TC T GGAGAACAG AT GG GT T G AGT AGT T G TT C T TAT AAAT 
T AGT AT C T GTG GAAC ACAAT CCT T TAT AT AT C AAC AT CACAGT GGAT TTCTGGTTTG GT GC ATGACC C 
TGGAT CT T TTGGT GAT GTTT GGAAGAACT GAT TCTTT GT T TGCAATAATTT T GGC CTAGAGACT T CAA 
AT AGTAG C ACAC AT T AAGAAC CT G T TAC AGC T CAT T G T T G AGCTGAAT TTTTCCTTTTT GT AT T T T C T 
TAGCAGAGCTCCT GGT GATGTAGAGTAT AAAACAGTT G T AAC AAG AC AG C T T T CT TAGT CATTT T GAT 
C ATGAGGGT TAAATAT TGTAATAT GGAT ACT TGAAGGACT TTATAT AAAAGGATGACTC AAAGGATAA 
AATGAACGCTATT TGAGGACT CTGGTT GAAG GAGATT TAT TT AAAT T T GAAG TAAT AT AT TAT G GGAT 
AAAAGGC CACAGGAAATAAGACTGCTGAATGTCT GAGAGAACCAGAGT TGT T CTCGT CCAAGGT AGAA 
AGGTACGAAGAT ACAAT ACT GTTAT TCAT TTATCCTGTACAAT CATCT GTGAAGT GGTGGT GTCAGGT 
GAG AAGG CG TC CACAAAAG AGGG GAGAAAAG G C GAC GAAT C AGG ACACAG T GAAC T T GG G AAT GAAG A 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTT CAGG GGAGGACC T GCCCAGGT ATGCCTTCC AGT GATGCCCAC C AGAGAATACAT TCTCTAT TAGT 
T T T T AAAG AGT T TTTGTAAAATGAT TT TGTACAAGTAGGATAT GAAT T AGC AG TT TACAAGTT T ACAT 
AT T AAC T AATAAT AAAT AT G T CTAT CAAATAC C T C T G TAG T AAAAT G T G AAAAAG C AAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKHmAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites; 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGC^CG ATG GCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAAC T T GCAGAGCT GCAAC CC CAGGACAGAGC TGGAGCCAGGGCCAGC TGGATGCC CAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATGG CTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
G AC C T T CAAC ACAAC CCCTCTTGT C AC CAT AC AG CC AG AAG GG G G CAC TAT CAT AG T G AC CC AAA 
ATCG T AAT AG G GAG AG AG TAG AC T T C C CAG AT GG AG GC TAC TC CC T GAAGC TCAGC AAAC TGAAG 
AAGAAT GAC TCAGGGAT CTAC TATGT GGGGATATACAGCT CAT CAC TCCAGCAGCCCT CCAC CCA 
GGAG TACG T GCT GCAT G TC TACGAGCACC TGTCAAAGCCT AAAGT CACCAT GGG T C TG CAGAGCA 
AT AAGAAT GG CAC C T G T G T GAC C AAT C T GACAT G C T G CAT GGAAC AT G G G G AAG AG GAT G T GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGAT GGGGAGAAAGTGATAT G AC CT TCAT C TGCGT T GCCAGGAAC C C TGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T TC T GAAG AGAG AGAGAC AAGAAG AG TAC AT T G AAG AG AAG AAGAG AG T GGACAT T TGTC GGG 
AAAC T C C T AAC AT AT GC C C C CAT T C T G G AG AG AACACAG AG TAC GAC AC AAT CC C T CACAC T AAT 
AGAACAAT C C T AAAG GAAG AT C CAG C AAAT AC G G T T TAC T C CAC T G T G G AAAT AC C G AAAAAG AT 
G GAAAAT C CC CAC T CAC TGC TCACGAT GCCAGACAC AC CAAGGC T AT T T GC CTAT GAGAAT GT T A 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAZ^ESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPIL7VRKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNI CPHSGENTE YDTI PHTNRT I LKEDPANTVYSTVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 



N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
AT AG GAGC AG G T C T GAT GG C CAT T C C AGCAAC AACAAT G T C C T T G AC AG CAAGAAAAAGAG C G T G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AAC AG T AAT G C C AAT T G T G AAT T T T CAT T GAAAAAC AT C AG T G AC AT T CAT C C AGAAT C C T T C AA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
AC AC C AT GG C GAG T G G C T GGAGAG C AT C TAG T T T C C AC T T C GAT T C T G AAG AAAAC AAAC AT AG G 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T TAAT GGGAATAAAAT GTAAGTATCAGTAGTT TGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAI PLIVSLVEEDQFSQNPISCFEWWFPGI IGAGLMAI PA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AG G AGC AG T GAG GAAC C T GC AGAG T C AC AC AG T T G C T G AC C AAT T GAG C T G T GAGC C T G GAG C AG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGC TGAG CACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE !?o 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHS PVPEKAI PL I TPGS AT TC 

Important features : 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



G TGGAC TC TGAGAAGC C CAGGCAG T TGAGGACAGGAGAGAGAAGGC T G C AGACC CAGAGG GAGG G 

AG G AC AGG GAG T C G GAAGG AG GAG G AC AGAGG AG GG C ACAG AG AC GC AG AGC AAGG GCG G C AAG G 

AG G AGAC C C T G G T G GGAGGAAGAC AC T C T GGAGAG AG AG G G GG C T GG G C AG AGATGAAG T T C CAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGC AGC TT TGGAAT GAAT CCTCAGGGAG C TCC CT GG G G T CAAG GAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CTT CAGAGGACAGGGAGT T TCCAGC AACATGAGGGAAAT AAGCAAAGAGGGCAATCGCC TCC T T G 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

G7\ATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

G CAT C C C G TGAC C T C C AGAC AAG GAG C C AC CAG AT T G GAT G G GAG C C C C C AC AC TCCCTCCT T AA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 
Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
T T GCAC CCAAGGAT AAT C T C T T C AT CAGG T TCC TGAAGCCC TGGC TGGGAGAAGGGATAC TGCT G 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
G AAG TCC TAT AT AACGATC T TCAACAAGAGT GCAAACAT C ATGC T T GAC AAG TGGCAGCACC TGG 
C C T C AG AG GG CAG C AG T C G T C T G G AC AT G T T T GAG C AC AT C AGC C T C AT G AC C T T GGAC AG T C T A 
CAGAAATGCAT C T T CAGCT T TGACAGC CAT T GTCAGGAGAGGCCCAG T GAATATAT TGCC ACCAT 
C T T G GAG C T CAG TGCCCTTG T AG AG AAAAG AAGC CAG CAT AT C C T CC AG C ACAT G GAC T T T C T G T 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
C AT TGTCAGATGAGGATATAAGAGC AGAGGC TGACACCT T C ATGT T T G GAG G C C AT GAC AC C AC G 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGG TGCAAGAGC T T C TGAAGGACCGCGATCC TAAAGAGAT T GAAT GGGACGAC CT GGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAG TGA CTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 

MSLLSLPWLGLRPVAMS PWLLLLLVVGSWLLARILAWTYAFYNNCRRLQCFPQPPKRISrWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 

GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 

TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 

TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 

ATAA T T CAT GG C CAG T T T TAT GAAGC T T T G GAAG GC AC TAT GGAC AG AAG C T GG T G GACAG T T T T 

GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 

G T GAT T CGAAC AT T TGAGG GT T AC T T T T G GAAGC AACAAT AC AT TC T CG AAC C T GAAT G T CAG T A ' 

GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 

T C T G GAT G T T G T CC C AC T GAAT T C C CAT G AAT AC AAAC C T AT T C AGC AAC AG C AAAAAAAAAAAA 

7VAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MGPVKQLKRMFEPTRL I AT IMVLLCFALTLCSAFWWHNKGLAL I FC I LQSLALTW YSLS FI P FAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 52 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGC^CCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 

G GAGATAAGGAC CTT GAT GGG CAGCT AGACT TT GAAGAAT T T GT CCATTAT CT CCAAGAT CAT GAGAAGAAGCT GAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAATT CTCAAGAGCAT GGATAAAAACGGCACGATGACCATCGACT GG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 

ATCTTTGAT GTGGGTGAGAATCTAACGGTCCCGGAT GAGTTCACAGT GGAGGAGAGGCAGACGGGGAT GTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 

T CACT CT G GCG GGG C AAT GGCAT CAACGT C CTCAAAAT T GC CC C CGAAT C AGCCAT C AAATT CAT G GCCTAT GAG C AG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
GGCAT CATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
AT AAT CCAT GAT GAAAGGT GAGGT CACGT GG CCT C CCAGGCCT GACT T CCCAAC CTACAGCAT T GAC G C CAACTT G GC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 

ACTGTT G G GAAAAGG GTTT TGT C CAGAAG GACAAG C C GGACAAAT GAGCGACTT CTGT GCT T CCAGAG G AAGAC GAG G 
GAGC^GGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCC^LACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
AACTATTT TTATAGAT TT GTTT AATT AAT AGCTT GT C ATTTT C AAGT T CAT T TT T TAT TC AT ATT TAT GT T C AT GGT T 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAATCTTCAATAGGATGCAAAGAT CAATGCAAAAATTGTTATATATGAACATATAACTGGAGT CGTCAAAAAG 
CAAATTAAGAAAGAATT GGACGT TAGAAGT T GT CAT TTAAAGCAGC CTT CT AAT AAAGT T GTT T CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSI FKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKI LDKKNDGRI DAQE IMQSLRDLGVKI SEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I I LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKG YVPNMLG 1 1 PYAG I DLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSS T C 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISYV 
VYENLKI TLGVQSR 

Important features : 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino 'acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 



App_ID=10063514 



Page 210 of 320 



•i « I G & 3 B .1 ■ | „ O S G :l Q 2 



FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
C AC T CATCAT T G GC T T T GG TAT T TC AGGGAGAC AC T C CAT CACAG T CAC TACT G T C GC C T CAGC T 
GGGAACAT TGGGGAGGATGGAAT CCT GAGC T GCACTT T T GAACCTGACATCAAACT T TCT GATAT 
C G T G AT AC AAT G G C T G AAG GAAG G T G T T T T AG GC T T G G T C CAT GAG T T C AAAG AAG GC AAAG AT G 
AGCTGT CGGAG CAGGAT GAAAT GT T CAGAGGCCGGACAG CAGTGT T T GCT GAT CAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAAqTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CAC T T C T AAAG G C AAG GG GAAT G C T AACC T T GAG T AT AAAAC T G GAG C C T T C AG CAT G C CG G 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
C AGCCC AC AG T GG T C T G GGCAT C CC AAG T T GAC C AGGG AGCCAACT TC TCGGAAG TC TCC AATAC 
C AGC T T T GAG C T GAAC TC TGAGAAT G T G ACCATG AAGG TTGTGTCTGTGCTC TACAAT GT TACGA 
T CAACAACACAT AC TCC TG TAT GAT T GAAAAT GACAT T G C CAAAGCAAC AG GGG AT AT CAAAG T G 
ACAGAATC GGAGAT CAAAAGGCGGAG T CACC T AC AGC T GC TAAACT C AAAGGCTTC TC T GTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCCTTGGCCACAAAAAAGC AT GCAAAGTCAT TGT TACAACAGGGATCTACAGAACTAT T TCAC 
CAC C AG AT AT G ACC TAG T T T TAT AT T T C T GG G AG GAAAT GAAT T CAT AT C T AGAAG T C T G GAG T G 
AG C AAAC AAG AGC AAG AAAC AAAAAG AAGC CAAAAG C AGAAGGC T C C AAT AT GAAC AAGAT AAAT 
CT AT C T T C AAAGAC AT AT T AGAAG T T G G GAAAAT AAT T CAT G TGAAC TAGACAAG TGTG T TAAG A 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T G T AAT G T T G C T C T GAG GAAG C C CC T G GAAAG T C TAT C C C AAC AT AT CC AC AT C T TAT AT T C CAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT T CCCAAC T GAC AAAT G C C AAAGT T GAG AAAAAT GAT CAT AAT T T T AGC AT AAAC AG AG C AG T 
C G GGGACAC C GAT T T TAT AAAT AAAC T G AGCACC T T C T T T T T AAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MA5LGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

TGACGTCAGAATCACC ATGG CCAGCTATCCTTACCGGCAG6GCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T G GTCT TCATTC AATGAT GAGACCT GCCT CATGATGAT AAACATG T T T GACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
G G CC T T C C GGGAG AAGGAC AC AGC T G T ACAAG GC AAC AT C C GG C T C AG C T T C GAGGAC T T C G T C A 
CCATGACAGCTTCTCGGATGCT ATGAC CCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C T G AC C AAGC AT GAGAG AG AT C T G T C T AT G GG AC CAG T G G C T T GG AT T C T G CCAC AC C CAT AAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T T GG GG CC AAAAG T CCAG T GAAAT T G T AAG C T T C AAT AAAAGG AT GAAAC TC T GA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 

S GY I SMKELKQALVNCNWS S FNDETCLMM I NMFDKT KS GRI DVYGFS ALWKFIQQWKNL FQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 

DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-33, 83-88, 87-92, 110-115 
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FIGURE 6a 

CAGGATCCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

AT C T GGGG TAT C AATAATCAACC C C CAG G T AC CAG C T G GG G AAAT AT T AAT C GG T ATC CAG GAG G 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGC TAGA GCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGCTCATCTCGPJVTAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of* the protein: 
Signal peptide: 

amino acids 1-2 6 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATG GGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGAT C T C AACACC AT G T T GT C T GC AACAC ATGA CAGC CAT T G AAGC C TGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation, site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGC TATTCAC T T TTATAT ATTTATATAAAATTAG 
TAG T GAGAT G T AAAAAAAAAAAAAAAAAA 
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FTGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 3 6-4 0 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

ammo acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
T CAAATAC TTC C T TC ATT AAGCT GAATAAT AAT GGCTT TGAAGATAT T GT C ATT GTT ATAGATC CTAGT GTGCCAGAA 
GATGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGA 
TTTTTTTT CAAAAAT GTAT CTAT ATT AAT T CCT GAGAATT G GAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AACCAT AAACAT G CT GAT GT TAT AGTT GCAC C AC CT ACACT CC C AGGT AGAGAT GAAC CAT ACAC CAAGCAGTTC ACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
T ACC GT GCT AAGT CAAAAAAAAT CGAAGCAACAAGGT GTT C C GCAGGT AT CT CT GGT AGAAATAGAGT TTAT AAGT GT 
CAAGGAGGCAGCTGT CTTAGTAGAGCATGCAGAATT GATT CTACAACAAAACTGTATGGAAAAGATTGTCAATTCTTT 
C CTGATAAAGTACAAACAGAAAAAGCATC CATAATGTT TAT GCAAAGT ATT GAT T CTGT TGT T GAATT T T GT AAC GAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
T CTGAGGATTTTAAAAACACCAT ACCCAT GGTGACACCACCTCCTCCACCT GTCTTCTCATT GCT GAAGATCAGT CAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACATTT CCTGCTGCAGACTGT T GAAAAT GGAT CCT GGGT GGGGAT GGTT CACT TT GATAGTACT GCCACT ATTGTA 
AAT AAGCT AATCCAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT G GCAG GAT T AC CTACAT ATCCT CT GGGAG GA 
ACTTCCATCTGCTCT GGAATTAAATATGCATTT CAGGTGATTGGAGAGCTACATTCCCAACT CGATGGATCCGAAGTA 
CT GCTGCT GACT GAT G GGGAGGATAACACT GC AAGTT CTT GTAT TGATGAAGTGAAACAAAGTGGGGCC ATT GTT CAT 
T T TAT T GCT T TGGGAAGAGCTGCT GAT GAAGCAGT AAT AGAGAT GAGC AAGATAACAGGAGGAAGT CAT T TT TAT GT T 
T CAGAT GAAGCT CAGAACAATGGCCT CATT GAT GCTT TTGGGGCT CT T ACAT CAGGAAATACT GATCT CT CC CAGAAG 
T C CCTTCAGCTC GAAAGTAAGGGATT AACACTGAAT AGTAAT GC CT GGAT GAACGACACTGT CAT AAT T GAT AGT ACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 
T G G GC ATAC AAT C TT C AAGC C AAAGC GAAC C CAGAAACATT AACTATT ACAGTAACTT CT C GAGC AGC AAAT TCTT CT 
GT GCCTCCAATCACAGTGAATGCTAAAAT GAAT AAGGACGT AAACAGTTTCCCCAGCCCAAT GATTGT TTAC GC AGAA 
ATTCT ACAAGGAT AT GTAC CTGTTCTT GGAGC CAATGT GACTGCT TT C ATT GAAT CACAGAAT GGACAT ACAGAAGTT 
TTGGAACTTTTGGATAATGGTGCAGGCGCTGATT CTTTCAAGAATGAT GGAGTCTACT CCAGGTATTTT ACAGCATAT 
ACAGAAAAT GGCAGAT ATAGCTT AAAAGTT CG GG CT CAT GGAG GAG CAAACACTGCCAG GCT AAAATT ACG GCCT C GA 
CTGAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATTGAT 
GAGGATACT CAGAC C AC CT T GGAGGAT TT C AGCC GAACAGC AT C CGGAGGT GCAT TT GTGGT AT CACAAGT CCCAAGC 
CT T C C CTT GCCT GACC AAT AC CCACCAAGT CAAAT CACAGACCTT GAT GC CACAGTT CAT GAGGATAAGAT T ATT CT T 
ACAT GGAC AG CACCAGGAGATAAT TT TGAT G TT G GAAAAGT TCAAC GT TAT ATCAT AAGAAT AAGTG CAAGT ATT CTT 
GATCT AAGAGACAGT T TT GATGAT GCTCTT CAAGTAAATACTACTGAT CTGTCACCAAAGGAGGCCAACT C CAAGGAA 
AGCTT T GC AT TT AAAC CAGAAAATAT CT C AGAA GAAAATGC AAC C CAC AT ATTT ATT GCCATTAAAAGT AT AGAT AAA 
AGCAATTTGACATCAAAAGTATCCAACATTGCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
CCTACACCTACT CCTACTCCTACTCCTACT CCT GATAAAAGTCATAAT TCT GGAGTT AATAT T T CTACGCT GGTATTG 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTAACGAAGAAAAAAATCTTC 
AAGTAGACCTAGAAGAGAGT TTTAAAAAAC AAAACAAT GT AAGT AAAGGAT ATTT CT GAAT CTT AAAAT T C ATCC CAT 
GT GT GAT CAT AAACTCATAAAAATAAT TTT AAGATGT CG GAAAAGGATACTTT GATT AAAT AAAAACACT CAT GGAT A 
TGTAAAAACTGTCAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAAT TAAAAT CAT CTAT CT GAGTAGT CAAAAT ACAAGTAAAGGAGAGCAAATAAACAACAT TT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVI IDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSK3SLAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYSRY FT AYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide : 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 

CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGC CAC CT CCTAAC CTT CCCT T CTT GAAC CCC C AGTTAT GCC AGGAT T T ACTAGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGT AGAGAAGCTGCT C TGT GT GGTGGTTAACT C C AAGAGGC AGAACT C GT T CTAGAAGGAAAT GGAT GC AAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
CCT TT CGAT AGCT TT ACT CTACAGAAG GTGTAC CAGCTGGAGACTG GCCTT AC C C GCCAC C CCGAGGAGAAGCCT GTG 
AGGAAGGACAAGCGGGATGAGTT GGTGGAAGCCATTGAAT CAGCCTT GGAGACCCTGAACAATCCTGCAGAGAACAGC 
CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
GT GAAAAAT GAAAAGCTCAACAT GGCCAACAC GCTT ATCAAT GTTAT CGT GCCT CTAGCAAAAAGGGT GGACAAGTT C 
CGG(^GTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAATAAATGAAGTCAAAG GAAT ACTT GAAAACACTT C CAAAGCT GCCAACT TC AGGAACTT T ACCT TC ATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCTGGT CATAAAGAAGGAAACTGGATTT T GGAGAGACTT TGGATTT GGGAT GACGT GT CAGTAT CGGT CAGAC 
T T CAT CAAT ATAGGT GGGT T T GATCT G GACATCAAAGG CT GG GGCG GAGAG GAT GT GC ACCTT T ATC GCAAGTAT CTC 
C ACAGCAAC CT CAT AGTGGT ACG GAC GCCT GT GCGAG GACT CTT CCACCT CT GGCAT GAGAAGCG CT G CAT GGAC GAG 
CT GAC CC C C GAG CAGT ACAAGAT GTG CAT GCAGT C C AAG GC C AT GAAC GAG GCATC C C AC GGCCAGCT GG GCAT GCT G 
GT GTT CAGGCAC GAGATAGAGGCT CACCTT CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACT CCCAGA 
GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
GGACGACAAAAGAATT GGACTGAT G GGT CAGAGAT GAGAAAGCCT C C GATTT CT CTCT GTT GGG CT TT TTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATTCAT GATT TAAGAGCAGT TTT GTAAAAAATT C ATT AGCAT GAAAGGCAAGCAT ATT T CTC CTC AT AT GAATGA 
GC CTAT CAGCAGGGCT CTAGTTT CTAGGAAT GCT AAAATAT CAGAAG GCAGGAGAGGAGAT AGGCT TATT AT GAT ACT 
AGTGAGTACATT AAGT AAAATAAAAT GGACCAGAAAAGAAAAGAAAC CAT AAAT AT C GT GT CAT AT TT T CC CCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
AT T TT TAT ATTT T TT AAGAAGAT ACTTT G AGAT G CAT TAT GAGAACTTTCAGTT CAAAGCATCAAATT GAT GCCATAT 
CCAAGGACATGCCAAATGCTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AATACAGAC GTACAGATACTTTCTCT GAAGAGT ATTT TC GAAGAGGAGC AACTGAACACTG GAGGAAAAGAAAAT GAC 
ACTTT CT GCT TT ACAGAAAAGGAAACTCAT T CAGACT GGT GAT AT CGT GAT GTACCT AAAAGTCAGAAACCACATTTT 
CT CCT CAGAAGTAGGGACC GCTT T CTTACCTGTTTAAAT AAACCAAAGT AT ACC GT GT GAAC CAAACAAT CT CTT T T C 
AAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGATC AAT C CAT C T GC CAGAAT C TAGT G GGAT GGAAGTTT T TGCT AC AT GT TAT C CAC C C C AGGC C AGGT GGAAG 
TAACT GAAT T ATTTTT TAAATTAAGCAGT T CTACT CAAT CAC CAAGAT G CT TCT GAAAATT GCATTTTATTACCATTT 
CAAACTATTTTT T AAAAAT AAAT ACAGTTAACAT AGAGT GGTTT CTTCATT CAT GT GAAAATT ATTAGCCAGCACCAG 
AT GCATGAG CTAATTATCT CTTT GAGTCCTTGCTTCT GTTTGCT CACAGT AAACTCAT T GTTTAAAAG CTT CAAGAAC 
ATTCAAGCT GTT GGTGTGTTAAAAAATGCATTGTATT GATTTGT ACT GGT AGTTTAT GAAATTT AATT AAAACACAGG 
CCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 72 

MMMVRRGLLAWISRWVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
T L I NV I VP LAKRVDK FRQFMQN FREMC I E Q DGRVHL TWY FGKE E I NE VKG I LEN T S KAAN FRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

ammo acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Ami da ti on s i te . 

amino acids 377-381 
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FIGURE 73 

GAG AC T G C AG AGG G AG AT AAAGAG AG AG GG C AAAG AG G C AG CAAG AG AT T T GTCCTGGGGAT CCA 
GAAAC C CAT GAT AC C C T AC T G AAC AC C G AAT C CCC T GG AAG CC C AC AG AG AC AG AG AC AG C AAG A 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAG TCGCCCAT CGATAT TCAGACAGACAGTGTGACAT T TGACCC TGAT T T GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T G AAGCCACAT T T G CAG AG C T C C AC AT T GT AC AT TAT GACT C TGAT T CC T AT GACAGC T T GAG T G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AGC T TAT GAAC AC AT T C T GAG T C AC T T GC AT GAAGT C AGG C AT AAAG AT C AGAAG AC C T C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CT C CT TAG GAT AAAG AG T TGC TGT T G AAG T T G TAT AT T T T TGATC AAT AT AT T TGG AAAT T AAAG 
TTTCTGACTTT 



AppJD=10063514 



Page 225 of 320 



,J„ 4j„,m a "u ..a., 

J 



FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 

TCTGGTGGT T TG CC T AAACC TGCAAAC ATCACC T TC T T AT CCAT C AACATGAAGAATGT C C T ACA 

ATGGAC TCCACC AGAGGGT C T T C AAGG AGT T AAAG T T AC T TACAC TGT GCAG TAT T TC AT CACAA 

AT T G GCCCACCAG AGGT GG CACT GAC T ACAGAT GAGAAG T CCAT T TC TGT T GT CC TGACAGC T C C 

AGAG AAG T GGAAG AG AAAT C C AG AAG ACC TTCCTGTTT C C ATGC AAC AAAT AT AC T CC AAT C TG A 

AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 

ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 

GAT T T T GAT TTATGGAAAT GAAT TT GACAAAAGAT T CT T T G TGCC TGCT GAAAAAAT C G T GAT T A 

ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

G G AAGAGG AG G T G AAAC AT T TAG G G T AT GC T T CG C AT T T G AT G GAAAT T T T T T G T GAC T C T G AAG 

AAAACACGGAAGG T AC T TC T C T C AC CC AGC AAGAG T CCC T CAGCAGAACAAT AC CC CC GG AT AAA 

ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 

GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 

TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

CCAGACAGGCCACCAGGAGAAAATGAAACC TAT C TCATGCAAT TCAT GGAGGAAT GGGGGT T AT A 

TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 

TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 

CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 

GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AG C AG T C AAT AAAT G T T T G C C AG AC T G G G T GCAG AAT T TAT T C AG GTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSE FKAK 1 1 FWYVLP I S I TVFL ITS VMGYS I YRY I HVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-2 8 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAATGAAAAGGC T GC AG AT AAGC T GGGAT C T ACC C AGATCG TG AAGAT CC T AAC T C AGG AC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T T AC ACC AAAG GT GAC CAAC T T AT AC TCAAC T T G AAT AACAT CAGC T C T G ATCGG AT CC AGC T G 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATL I QATLS PTAVL I LGPKVI KEKLTQELKDHNATS I LQQLPLLSAMREK 
PAGGIPVLGSLVNTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDC^TSHGSLRIQLLYKLSFLTOALAKQVmLLVPSLPNL 
VKNQLC PVI EAS FNGMYADLLQLVKVP I SLS I DRLE FDLL YPAI KGDT IQL YLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DRI QLMNSG I GW FQPDVLKNI I TE 1 1 HS I LL PNQNGKLRS GVPVS LVKALG 
FEAAE S S L T KDALVLT PAS LWKPSS PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



G AG AG AAG T C AG C C T G G C AG AGAG AC T C T G AAAT G AGG GAT T AG AGG T G T T C AAGG AG C AAG AG C 
TT C AGC C TGAAG AC AAGGG AGCAG T C C C T GAAGACGCT TC TAC TGAGAGGT C TGCCATGGCC TC T 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
C C AAGG G C C T C T G GAT G G AAT G T G C C AC AC AC AG C AC AG G CAT C ACC C AG T G T G AC AT C T AT AG C 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
G T CAGAAGG TGCTGCT GAG GAT AG AC T GAC T T T G GC CAT TGGAT T GAGCAAAGG CAGAAATG GGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
C C T CCAAAGAAACT GAT T GGCCC TGGAACC T C CATC CCACT C T TG TT AT GACTCCACAG T G T CCA 
GACTAAT TT G T GCATGAAC T GAAAT AAAACCAT C CT ACGGTATCCAGGGAACAGAAAGCAGGAT G 
C AG GATGGGAGG AC AG GAAGGCAGC C T GGGAC AT T T AAAAAAATA 
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FIGURE 80 



MASLGLQLVGY I LGLLGLLGTLVAMLLPS WKTSS YVGAS I VTAVGFSKGLWMECATHS TG I TQCD 
I Y S T LL GL PAD I QAAQAMMVT S S AI S S LAC IIS WGMR CTVFCQES RAKDRVAVAG GVF F I L GG L 
LGFI PVAWNLHG I LRD FYS PL VP DSMKFE I GEALYLG IISSLFSL IAG 1 1 LC FS CS S QRNRSNY Y 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide : 

amino acids 1-24 

Transmembrane domains : 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 8l 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATG GTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
C CATGGGAAGAGAT ACT CCCCCG GCGAGAGC TGGCACCCC TAC T T GGAGCCACAAGGCCT GATG T 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACT G CC AG AC T C C T G C T GCC AAG C C T G C AAAG AT GAG G C AAGT GAGC AAT C GGATGAAG AG G AC A 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

C AG AC C C AAG G GAG C AG GC AG C AC AAC T G T C AAG AT CGT C C T G AAGG AG AAAC AT AAG AAAG C C T 

/ 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C T C G GAC T T G G T G GAG AT C T ACC T C T G GAAGC T G G T AAAAG AT G AGG AAAC T GAGG C TCAG AG AG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
G AAG C AAG AC T T C C AG AAAG AGG C AC AG C AC T T C C GAC TGCTCGCTGGCCCC C AC G AAG G T C AC T 
GGAACG T CT T CC TAGCC CAGACCC TGGAGC T GAAG GT C AC GGCCAGT CC AGACAAAGTGACCAAG 
ACATAAC AAAGACC TAA CAGT TGCAGATATGAGC T GTATAAT TGT T GT TAT TAT AT AT TAAT AAA 
T AAG AAG T T GC AT TAC C C TC AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S STRCPKAPGRVLVHTS VS P S PDNLRRFALEHEAS DLVE I YLWKLVKDEE TEAQRGEVPG PRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGT GAGGTCCATTCAC AGAACACAT CCATGGCTCT CATGCTCAGTTT GGTTCTGAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT AC GT GACT T T GT CTCCCG AT CAT G GGT ACT GGGT CC TC AG AC T GAAT GGAGAAC AT T T GT AT TT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAAT GTAGG ATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACC GT C AGG AAT T CC C AT C TC AC AGGC TG TG GT GT AGAT T AAGT AGACAAGG AAT GTG AAT AATGCT TAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE Sj. 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAEL'RDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI CPVTQESEKEASWQRASAI PETSNS ES S SQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 2 39-255 
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FIGURE 8 s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATG CTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
G G GAC C CACAT ACC AAG AAT T GC AC C C T GAGC AT CAGAG AT GC CAGAAGAAG T GAT GC GGGG AGA 
T ACT T C TT TCG TAT G GAGAAAGGAAG T ATAAAAT GGAAT T ATAAACAT CACCGGC TCTCT G T GAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GAC G GC AC AG TAT C C AC AG T C T T G G G AAAT GGC T CAT C T C T G T C AC T C C C AG AG GG C C AGT C T C T 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGA GAAACTGCAGAGACTCACCCT GAT TGAGGGAT CACAGCC CC T C CAGGCAAGGGAGAAGTCA 
GAG G C T GAT TC T T GT AG AAT T AAC AG C C C T C AAC G T GAT GAGC T A T GAT AAC AC TAT G AAT TAT G 
T GC AGAGTGAAAAGCACACAGGC T T TAG AG T C AAAG TAT C TCAAACC TGAATCCACACT GTGCCC 
T CCC T T T TAT TTTTT TAAC TAAAAGAC AGACAAAT T CC T A 
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FIGURE 86 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWI YPGPVVHGYW FREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LG S QQVYLNVS LQ S KAT S G VT QG WGGAGAT AL VFL S FC V I FWVR S CRKKS AR PAAG VGD T G I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 



Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
C C AAGG AAAG T G C AG C T GAGAC T C AG AC AAGAT T AC AATGAAC C AAC T C AG CTTCCTGCTGTTTC 
T CAT AG CG AC CAC C AG AGG AT GG AG T ACAGAT GAG GC TAATAC T TAC T TCAAGGAAT GGACC TGT 
T CT TCGT C TCCATC TCTGCCCAGAAGC TGCAAGGAAATCAAAGACGAAT G T CC TAG T GCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
C AAC TACAACACC T T TGGAT C TGCAGAGGCGGCCACGAGCG ATGAC TACAAGAACC C TGGC TACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAAC AGC TCCCTGCT GAGG T ACC G C AC GGAC AC TGGCTTCCTC C AGAC AC T G GG AC AT AAT C T 
G T T T GG C AT C T AC C AGAAAT AT C CAG T G AAAT AT G GAGAAGGAAAGT G T T G GAC T G AC AACG GC C 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGC GGGAAT T CAC T GCGGGAT TTGT TCAGT T C AGGGTAT T T AAT AAC GAG AG AGC AG C CAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGAGAGT T T TGT GGG AGGGAAC CC AGACC TC TCCT CCCAACCATGAGAT C CCAAGGAT G GAGAA 
C AAC T TAC C CAG TAG C TAG AAT G T T AAT GG CAGAAGAGAAAAC AAT AAAT CAT AT T GAC T CAAGA 
AAAAAA 
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FIGURE 88 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKS PMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

GlYcosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8Q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATGG GCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
T GAC GG GAAC C T AAGGAC GAAGC C T AC AGGGGC AAG GGCCGCTTC G T AT T C C T G GAAG AAG G AAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TC T TGAGT CT GGGAT T ATCCGCATT G TAT T TAGTGC T TT GTAATAAAAT AT G T T T TGTAGTAACA 
T T AAG AC T TAT AT AC AG T T T TAG G G G AC AAT T AAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Ql 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAG ATG CAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
C C AACAC AAT G C C AC AG GC C AG G GAAAC AT C T CC AT C AG CCTCGTGCCCCC CAG T AAAG C T G TAG 
AG T T C C AC C AG G AAC AG C AG AT C T T CAT C G AAG C C AAG G C C T C C AAAAT C T T C AAC T G C C G GAT G 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGG TGAC CCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
T C C C CGAGGC CT G T G G G CAGG CC GAT CAG T GT GG C C CC AGAT C AAG T CATGG GAGGAAG C T AAG C 
CCTTGGTTCTTGC CAT C C T G AGG AAAG AT AGC AAC AG G GAG G G GG AG AT T T CAT CAG T G T G G AC A 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

ammo acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q?t 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC T CTGCC TGC TCT GC CAAGACAAGAAC T T T C T TCT T T ACAACCAGCGC TC CAGAT 
AAC C T C AG GGAACC AG C AC T T CC C AAAC C GC AGAC T AC AT C T T T AGAG G AAG C AC AAC TG T G CC T 
T T T T C T GAAAAT CCCTTTTTCTGGT GG AAT T GAGAAAG AAAT AAAAC TAT GCAG AT A 
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FIGURE Q4. 

MT AAV F FGC AF I AFG P ALAL Y V FTIAIEPLRIIFLI AG AF FW L VS L L I S S L VW FMAR V 1 1 DNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FS FVNTLSDSLGPGTVG IHGDSPQFFLYSAFMTLVI I LLHVFWG I VFFDGCEKKKWGI LL I VLLT 
HLLVSAQTFISSYYGINLASAFI ILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Qft 

AAT T T T T CACCAGAGT AAAC TT GAGAAACCAAC T GGACC T T GAG TAT TGTACAT T T TGC CTCGT G 
GACCCAAAGGTAGCAATCTGAAAC ATGA GGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
G AAC AC T AC C AAAC C AAC AG C AG T CAAAT C AGG TCTTTCCTTCTT T AAG T C T G AT ACC AT T AACA 
C AG AT G C T C AC AC T GG GGC C AG AT C T G CAT C T G T T AAAT C C T G C T G C AGG AAT G AC AC C T G G T AC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
C AAT T T T T G T C ACAC AAC T T G G AGC C C AG GG C AC TAT CC T AAG C T C AG AG GAAT T G C C AC AAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAA GCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT GGAAT AGAT T G AGAC AC AT T G GAT AG TCT TAGAAGAAAT T AAT T C T TAAT T 
TAC C T GAAAATATTCT TGAAAT T TCAGAAAAT ATG T TCT AT G T AGAGAATCCCAAC TTT TAAAAA 
C AAT AAT T C AAT G GAT AAAT CTGTCTTT G AAAT AT AAC AT TAT G C T GC CTGGAT GAT AT G CAT AT 
TAAAACATAT T T GGAAAAC T GG AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTS QAG AN PD VQDG S L PAG GAGVNPAT QG T P AG RL P T P S G 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide : 

amino acids 1-16 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
G C AG GC AGAG T ACC AGC T AC AG G T C AC CC T G GAGAT G CAGG AT G GACAT GTCTTGTGGGGTC C AC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
G GAT G AG CC AG G CAC AGC C AAC T C G GAT C T T C GAT T C CAC AT C C T GAGC C AG GC T C CAG C CCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
AC C AGC C T T G AC CAC G C C C T G GAG AG G AC C T AC CAG C T GT T GGT AC AGG T C AAG GACAT G G G T GA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
C CC TAG AG C C TAT C CACC T G G C AG AG AAT C T CAAAG T C C TAT ACC C G CAC CAC AT G G C C CAG G T A 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGT C CACCAGG T AC T GAAGTGAC TAGAC TG TCAG CAG AG GAT G CAGAT GCCCC CGG CTC CCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
T CG AAG T CG C AG T C AC AGAT AT CAAT GAT CAC GC C CC T GAGT T CAT CAC T T C C CAGAT T G GGC C T 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACC CAC TGGACC AT G T CAAGGAAGAAGGACCCG GATCAACCAGCAGACAGCGTG CCC CT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGG GAGAG AGC C CAG CAC C CAAG AT C CAGC AG G G G AC AG GAC AGAGT AG AAG C C C C T C C A 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFL FLEAS DRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T S LDHALERT YQLLVQVKDMGDQAS GHQAT ATVEVS 1 1 ES T WVS LE P I HLAENLKVL Y PHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVP I C P PRDP TVS I PE L S P P G T E VTRL S AE DADAP G S PN S HWYQL L S PE PE DGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNI LLLVLAMDLAGAEGG FS STCE VEVAVTDINDHAPE FI TSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQS VAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEAS VP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PVWS HNAQMWQLLVRV I VCRCNVEGQCMRKVGRMKGMP T KL S AVG I LVG T LVAI G I FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide : 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT C TAG G AC CCAGG CAT C TTGCTTTC CAGC CAC AAAGAGAC AGATGAAGAT G CAGAAAGG AAAT G 
T T C T C C T TAT GTTTGGTC T AC TAT T G CAT T T AG AAG C T G C AAC AAAT T C C AAT GAG AC TAG C AC C 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
G CAT AG T CAC CAAC T C T GAGT T C C AT ACAAC C T CC AGT G G GAT C AGCAC AGC C ACC AACT C T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
G G C CAG CAC AGC C ACC AAC T C T GAG T C CAGC AC AC C CT C CAG T G GG G C CAG C AC AG T CAC CAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAAC T C T G AC T C CAGC AC AAC C T C CAG T G GG GC TAG CAC AG CC AC CAAC T C T GAG T CC AG CACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
G C CAC CAAC T C T GAG T C CAG CAC AAC C T C CAG T GG G G C C AG CAC AG C CAC CAAC T C T GAG T CC AG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
G G C TAG C ACAGCCAC CAAC T C T G AC T CCAGC AC AAC C T C CAG TGGGGCCG GC AC AG C CAC C AACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA' 
GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G T G G AG T C C T AAC T G G T T C T GG AG GAG AC CAG TAT CAT C GAT AG C CAT GG AG AT GAG C G G GAG GA 
ACAGC GGGC CC TGAGC AGCC CC GGAAGCAAG T GCCGC ATTC TT CAGGAAGGAAG AGAC CTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T T G AAG AAGG TAT T C C TC AC CTTTCTTGCCTT T AC CAG AC AC T G GAAAG AG AAT AC TAT AT 
T G C T C AT T T AGC T AAG AAAT AAAT AC AT C T C AT CT AAC AC ACACGACAAAGAGAAG CTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C T GG C AT T C AAAA T C T C CAC AG T AAAAT C C AAAGAC C T C AAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE loo 

MKMQKGNVLLMFGLLLHLEAATNSNETS TSANTGSSVI SSGASTATNSGSS VTSSGVS TAT I SGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTT5SGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGAT GAT G T GAT AT ATG C AT T C CAG G G GAAG GG AAAT TGT GGTGCTTCT G AACC CAT G G T CAAT T 
AAC GAGG CAG T T T C T AGC T AC T GC AC G T AC T T C AT AAAGCAGG AC T C T AAAAG C T T T G G AAT CAT 
GGT G T CATG GAAAGGGAT T T AC TT TATAC T GACTC TGT T T T GGGGAAGC T T T T T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TAT AAC T GG GGAT GC AT TTGTTCCTG GAGAAAG AAG TGT CAT T ATCATGAACC AT C G GACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C T T CAT T CAT AG GAAA T G GAAG GAT G AC AAG AG C CAT T T C GAAG AC AT GATTGATT AC T T T T GTG 
AT AT T CACG AAC C AC T TCAAC T CCT CAT AT T CCCAGAAGG G AC T GAT C TC AC AGAAAAC AGCAAG 
TCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
AT AT C AC T GT GG C G TAT CCT C AC AAC AT T C C T CAAT CAG AG AAG CACC T C C T C C AAG GAG AC T T T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAAT AT T T G G T G GAC T GG AG AT CAT AG AAC T T G CAT GT T AC C G AC T T T T ACACAAACAG C C AC AT 
T TAAAT T C AAAGAAAAAT GAG TAAG AT T AT AAGGT T T GC CAT GT GAAAACCT AGAGCAT AT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGG C CAG AAT AT TAT T AAAC AAT C AT C AGG C T T T T AAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATG GCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
C AC C T C T AC AGAGAT GGGGAAGACT GGGAAT C TAAGC AG AT GC CAC AG TAT C GAG GGAGAACT GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATC C AAAGGG AAAAT C C AG GC G G AAC T GG AC T G G AG AAG AAAGC AC GG AC AG G C AG AAT TG AG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T T CT GAT C TGAAAAC T G TAACCC AT AGAAAAGC TCC CC AGGAGGT GCC TCAC T C T GAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GAC AT GT C AG T T T GAAG GC T T G T T G AGAC C C TAT AT C C AG CAT GC GAT G TAT GAC GAG GAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
C C CAC AC CAC AGAC CCAG AC AC AG CC AAG GG AG AG T G C T C C CG AC AGG T G G C C C CAG CTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAG AAG C C AT G GC T GCC C T GAAG T G G G GAC GGAAT AGAC T CACAT TAGG T T TAG T T T G T GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
AC AG T GAT TCCTGCCT C ACAGG T GAAG AT T AAAGAGAC AAC G AAT G T G AAT CAT G C T T G CAG G T T 
T GAG G G CAC AG T GT T T GC T AAT G ATG T GT T T T TAT AT TAT AC AT T T T C C C AC C AT AAAC T C T G T T 
T G C T TAT T C CACAT T AAT T TACT T T T C T C TAT AC C AAAT CAC C CAT GGAAT AG T TAT T G AACACC 
TGCTTTGT GAGG C T CAAAGAAT AAAGAGGAG G TAG GAT T T T T CAC T GAT T C TAT AAG C C C AGC AT 
T AC C T GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAG GAAGGAAAAC TACAG G TC CATAT CC 
CTC AT TAACACAGACACAAAAAT T CTAAATAAAATTTTAACAAATTAAACTAAACAATAT AT TTA 
AAG AT GAT AT AT AAC T AC T C AGT GTGGTTTGT CC CAC AAAT G CAG AG T T G G T T T AAT AT T T AAAT 
ATC AACCAGTGT AAT TCAGC ACAT TAAT AAAGTAAAAAAGAAAACC ATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 105 

CCTTCACAGGACTCTTCATTGCTGGTTGGCAMGATGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
T G AC AAACT AT AT G CT G AGT T T GGC AG AG AGGCT TCT AAC AAT T T T AC AG AAAT GAGC C AG AGACT T G AAT 
CAATGGTGAAAAAT GC ATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACT GT AG AT AAAATTGTTC AAGT TGTTTT AC ATGAAAAGCT GC AAGATGCTGT AGGACCCCCT AAAGT AG 
AT CCTC ACT CAGT T AAAATT AAAAAAAT CAAC AAG AC AGAAACAGACAGCT AT CT AAACC AT TGCT GCGG A 
ACACGAAG AAGT AAAACTCT AGGT C AGAGTCT CAGGATCGTT GGTGGG AC AGAAGT AG AAG AGGGT G AATG 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTGATGC AT CCT ATGAGTTTCAACC AGGT GATGTG ATGT TTGTGAC AGGATTTGG AGCACT GAAAAAT 
GATGGT T ACAGT CAAAATC ATC TT CGAC AAGC AC AGGT GACT CT C AT AG ACGCT AC AACTT GCAATG AACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGC AAT AGTTGAAACTTT ATGT ACATAGAGAAATAGAT AAT ACAATATTAC 
AT T AC AGCCT GT AT T C ATTT GT TC TCT AG AAGTT T T GT CAG AAT T T TG ACT T GT TG AC AT AAAT TTGT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGC AAAT ATTT ATTT AACATTG 
T T AC T G AGG ATGT C AAC AT AT AAC AAT AAAAT AT AAATCACC C A 
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FIGURE 106 



MMYRPDWRARKRVCWEPWVI GLVI FI SL I VLAVC I GLTVHYVRYNQKKTYNYYSTLS FTTDKLY 
AE FGREAS NN FTEMS QRLE SMVKNAFYKS PLREE FVKS QVIKFS QQKHGVLAHMLL ICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAGSLEGKT DACQGDSGGPLVS SDARD I WYLAG 
I VSWGDECAKPNKPGVYTRVTALRDW I TSKTG I 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT G GT C AC T T T GAAAG C CAG GAT G AC AAAT TAT G GAT T AC C T AGAT AT C G G T G G C T T AC T CA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
C AGG C CC AC CAG GAAGAT C T CAG T GACC T T TAT CAAG AG GG T T G T GGGAAGAAAAT G TAT T CC T T 
TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
G T T G AAAC C AAGC C T G T C AAG AAT C T T T G AAC AC AC AT C CAT GG C AAAC AGC T T T AAT AC AC AC T 
T T GAG AT G G AGGAG T T ATAAA AAGAAAT G T C AC AGAAGAAAAC C AC AAAC T T G T T T TAT T G G AC T 
T G T G AAT T T T T GAG T AC AT AC TAT G T G T T T C AGAAAT AT G TAG AAAT AAAAAT G T T G C C AT AAAA 
T AAC AC C T AAGC AT AT AC TAT T C TAT G C T T T AAAAT GAG GAT G G AAAAG T T T CAT G T CAT AAG T C 
AC C AC C T G G AC AAT AAT T G ATG C C C T T AAAAT G C T G AAG AC AG AT G T CAT AC CC AC T G T G T AGC C 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
TCC AT G C AAAC GAG T CAC AT AT G G T GGGAC T GG AG C C AT AGT AAAGG T T GAT T T AC T T C T AC C AA 
C TAG TAT AT AAAGT AC T AAT T AAAT G C T AAC AT AG G AAG T T AGAAAAT AC T AAT AAC T T T TAT T A 
C T CAG C GAT C TAT T C T T C T GAT G C T AAAT AAAT TAT AT AT C AG AAAAC T T T C AAT AT T G G T GAC T 
ACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T T CAG T T C T GAT AAT G T T AAGAAT AACCAT TAT GAAAAG GAAAAT TTGTCCTG TAT AG CAT CAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T G T TAT T TAAATAC T TAACCACT AAT T T TGAAAAT TACC AGT GT GATACATAGGAAT CAT TATT C 
AG AAT G T AG TCT GGTC T T TAGGAAG T AT TAATAAGAAAATT T G CAC AT AAC T TAG T T GAT TCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
G C T TAT CTTTGCCTTCTC C AAAC AAGAAG CAAT AG T C T C CAAGT CAAT AT AAAT T CT ACAGAAAA 
TAG TGTTCTTTTTCTC CAG AAAAAT G C T T G T GAGAAT CAT T AAAAC AT G T GACAAT T TAG AGAT T 
C T T TGT T T TAT T T CAC TGAT TAAT AT AC T GTGGCAAAT T ACACAGAT T ATTAAAT t T T T T TACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T T C T CAG AAT AT GGAAAG AAAAT T AAAAT GTGTC AAT AAAT AT T T TCTAGAGAGTAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLMSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains : 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 

CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 

GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 

CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 

CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 

CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 

TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 

CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 

TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 

CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 

CAGCTGCTCAGATACCAGCGAGT GCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 

ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGT GAACCTCAAAAACTCCTACCGCTGT 

GAGT GT GGGGTT GGC CGTGT GCT AAGAAGT G ATGGCAAGACTT GTGAAGACGTT GAAGGAT GCCACAAT AAC AA 

TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 

CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 

CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 

CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 

ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 

CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 

GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 

TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 

CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 

TCC CT GTCTT C AAGT TT GT GGGC AAAGACC AC AAGGAAGTGT TTCT GC ACT GCC GGGT TCT T GT CT GT GGAGT G 

TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 

CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACT GGGAGGACTAGTTCGTAGCCATACCTC 

GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 

TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 

CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 

CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGT GCAATCAGAC 

CACAAAATCAGAAGCTGGGTATAATATTTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACTGAAATTATGA 

CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 

T T AT AGGGAAT TT GGAAGT GT ATCAAT AAAACAGT AT AT AAT TT T 
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FIGURE no 



MPPFLLLTCLFITGTSVS PVALDPCSAY ISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYEILTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTC FDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTVVDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEI FKDNEFEEPYREALPTLKLRDSLYFGIEPVVHV 
SGLESLVESCFATPTSKI DEVLKYYLIRDGCVSDDSVKQYTSRDHLAKH FQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N- gly cosy 1 at ion si tes . 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy la tion site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 

GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GGAG ACCTTCAGAAAGGTGGGGATCCCCAT CAT CAT AGC ACT ACTG AGCC TGGCGAGTAT CATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GC T AC AGC AGAGCTGT GGAG AT T GGCCC AG AC C AGG AT CTGGAT GT TGTTG AAAT C AC AG AAAAC AGCC AG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 
ACCAGGGGGAAG T CAC CG AGAAGAT G AT GTGT GC AG GC ATCC CGG AAGGGGG T GT G GAC ACCTGCC AGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHC FRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIII IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATG CTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
G AAC G CC G G AGG AAAG C AAAG T G GCAGGGAAG GAAC T T GT G CC AAAT T AT GG G T C AG AAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCT GAGAGGCC C T C C TAT G T CC C TAG T AAAG C C AC C AGC AAGAC AT AG C T G AC AGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCA(3CAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
C AAAT G AT CT CC AAGG GC C C T T AT AC CCCAG GAGAC T T T GAT T T GAAT T T GAAAC C C C AAAT C CA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C C AG C T AC T C G G G AGG C T GAG AC AG G AGAAT T AC T T GAAC C T G G GAG G T G AAG GAG G C T GAGAC A 
G G AG AAT C AC T T C AGC C TG AGC AAC ACAG CG AG AC T C T G T C T C AGAAAAAATAAAAAAAG AAT T A 
TGGTT AT TTGTAA 



App_ID=l 00635 14 



Page 265 of 320 



...fc. n,..i J JLj* „,..}• „,n„ .'5 nJi \> n.„>' ,„iu, £.1 



FIGURE 114. 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE U5 



C AG CAG T GGT C T C T CAGT CC T C T C AAAGC AAGG AAAGAG T AC T G T G T G C T GAG AG ACCATGGC AA 
AGAAT CCT C CAGAGAAT T GT GAAGAC TGTCACAT TCT AAAT GCAGAAGC T T T TAAAT C CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCT ACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAAT T GATCC T GT GAC CAGAAC TGAAAT A 
TT CAGAAGCGGAAATGGCACT GAT GAAACAT TGGAAGTGCACGAC T T TAAAAACGGAT ACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AG AAG AG GAAAT AGAT G AGAAT GAAG AAAT T AC C AC AAC T T T C T T T GAAC AG T CAG T G 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
AC T T T GAG GAGG AG G GAG AAG AT C T T CAC TTTCCTGC C AAC G AAAAAAAAGG G AT T G AAC AAAAT 
GAAC AG TGGGTGGTCCCT C AAG T G AAAG TAG AG AAG AC C C G T C AC G C CAG AC AAG C AAG T GAG G A 
AG AAC T T CC AAT AAAT GAC TAT AC TG AAAAT GGAAT AG AAT T T GAT C C CAT G C T G GAT G AGAG AG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T T ATC AC CAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites . 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-rnyristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 

GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 

GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 

CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 

TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 

GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 

CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 

TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 

GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 

TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 

TGGGTCC AACAC CAAAAACAAG AAGAT AT ACGATGG AGGT GC CCGC AC AG AGGACG AGGT AC AAT CTT AT C 

CTTCCAAGCACGACTATGTG TAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 

CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 

CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 

TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 

TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 

GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 

ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 

CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 

AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 

GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 

ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 

TCGAG ACT AGCCT G GGC AAC AT GG AG AAGCCC T GTCTCT AC AAAAT AC AG AG AGAAAAAAT C AGCC AGTC A 

TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 

TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 

AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLP^LQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AG VS V FANML VTN FWMS TANM Y T GMGGMVQT VQ TR Y T FGAAL FVG W VAG G L T L I G G VMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT CAGGAT GC AGTG C AAAAT C TAT GAT TCCCTGCT GGC T CT TT C T CC G GACC T ACAGG CAG C CAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACC AG GT G C AC G GGG G ACAAT GAG AAG GT GAAGGC T CAC AT T C TG C T GACGG C T GG AAT CAT C TT CAT 
C^TCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGAT AC T C GAT ACC T T C C CAT CG C AC AACC CAAAAAAG T TAT C ACAC C GGAAAG AAGT CAC C GAGC G 
TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CT TAAT TAC AGG AAC T GT GC AT C AGC T AT T TAT GAT T C TAT AAG C T AT T T CAG CAG AAT GAG AT A 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CT C TTTTT ATCATTTAC T T C AAAAT GACATT GCTAAAGAC TGCAT TATT T TACTACT GT AATTTCTC C 
ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 
T T T AT TT AAAAT GAAAT G C C AGT C CAT T ACAC T GAAT AAATAGAAC T C AAC TAT T G C T T T T CAGG GAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
T T TATAAT GAAGAT T AAAAT G AAG GCTT TAAT CAG C ATT G T AAAG GAAAT T GAAT GGCT TTCT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T T CCAGG G C TAT ACT C AGAAG AAAG AT AAAAGT GT G ATC T AAG AAAAAG T GAT GGT T TT AG GAAAGT G 
AAAAT ATT T TTGTTT TTGTAT T T GAAGAAGAAT GAT GCAT TT T G AC AAG AAAT CAT AT AT G TAT GGAT 
ATATTTTAATAAGTAT T T GAGTACAGACT TT GAGG T T T CAT GAAT AT AAAT AAAAG AGC AGAAAAAT A 
T GTCT TGGT TTT CATT T GCTT ACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TAT AT TAAT AAAT T GT ACAT T T T T C TAAT T 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDSL1ALSPDLQAARGLMCAASVMSFIJIFMMAILGMKCTRCTGDNEKVKAHILLTAGI I FI I TG 
MWL I PVS WVANAI I RDFYNS I VNVAQKRELGEAL YLGWTTALVL I VGGAL FCCVFCCNEKS S S Y 
RYS I PSHRTTQKS YHTGKKS PSVYSRSQYV 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAATGT CTGAGGGAAAGCT T TGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT G AAG C T AT AAT T TAT T T G G AC C AAGG AAG C C C T G AAAT G AAT T C AAC AAT T AAT AT T CAT C G 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GG CAC T T G TTCAGAT TAC CCAAAAGGAGAT GCT TCTAC T G GAT GGAAT T CAG T T T CTCGCATC 
ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
T T GGAAT GGT T CAC T TAAAT G AC AT T T T AAAT AAG T T TAT GT AT AC AT C T GAAT G AAAAG CAAAG 
C TAAAT AT GT T T ACAGAC CAAAG T G T GAT T TCAC ACT G T T T T T AAATC TAGCAT TAT T CAT T T T G 
CTTCAATCAi^AAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
TAAAT AAAAAT TAT T T CC AAC A 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPS SASE I PKGKQKAQLRQRE WDL YNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T G C C T G GAAG AAT ACAT CAT GTTT T T C GAT AAGAAG AAAT T GT AGGAT CCAGT TTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GC CC C T AAG CC AAAGC AAAAG ACC T AAGG AC GAC C T T T G AAC AAT AC AAAGG ATGG G T T T C AAT G 
T AAT TAG G C T AC TGAGCGGATCAGC TGTAGCACTGGT TATAGCCCCCACTGTCT TACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
C T T GACCAT AAC CAT AT C AG CAAT AT T GAC GAAAAT G C T T T T AAT G GAAT AC G C AG AC T C AAAG A 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTG G CAT GAT CAG AC T CAAAGAAC T T C AC C T G GAG CACAAT CAAT T T T CC AAG C T CAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGT C TAAGGGAGAAT ACAAT TAT C TGT GCCAGT CCCAAAGAGC T GCAAGGAGT AAATGT G 
ATCGATGCAGTGAAGAAC TACAGCAT CT G TGGCAAAAG T AC TACAGAGAGGT TT GATCTGGCC AG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
G G C AC AG GAAAAAG AAAAG AC AG T CC C T AAAG C AAAT GAC T CC CAG CAC C CAG GAAT T T TAT G T A 
GAT T ATAAACCC ACC AAC AC GGAGAC CAG CGAGAT G C T G C T GAAT G GG AC GG GAC C C T G C AC C T A 
TAACAAATCGGGCTCCAGGGAGTGTGAGGT ATGAA CCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT CAT ACAT AAT CAAC C C AT T GAAAT T T AAAT AC CACAAT CAAT G T GAAG C T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
S GNE I EAFS G PS VFQC VPNLQRLNLDSNKLT F I GQE I LDS W I S LND I S LAGN I WECSRNI CS LVN 
WLKSFKGLR£NTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 , 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12s 



CCGTTATCGTCTTGCGCTACTGCT GAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTT T GCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 

AT T GAAGAG GAAGG C T T T C T AAAG C T T T GGC AAGGAGT GAC AC CCG CC AT T T AC AG AC AC GT AGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AGC CAAT C C AAC T GAC C T AG T G AAG G T T C AG AT GC AAAT GG AAGGAAAAAG G AAAC T G GAAGG 
AAAAC CAT T G C GAT TTCGTGGTG T AC AT CAT GC AT T T G C AAAAAT C T TAG C T G AAG GAG G AAT AC 
GAGGGCT T T GGGCAGGCTGGGT AC CCAATATACAAAGAG CAGCACTGG TGAATAT G GGAGATT TA 
ACC AC T TAT GAT AC AGT G AAAC AC T AC T TGG T AT T G AAT ACAC CAC T T GAGG AC AAT ATC AT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAAT CAAC CAC GAGAT AAACAAGGAAGGGGAC TT TTG TAT AAAT CATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGV I GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAK I LAEGG I RGLWAGW VP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLL YKS S TDCL I QAVQGE G FMSL YKGFLP S WLRMT PWSMVFWLT YEKI REMSGVS P F 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAG AGGAC AT CC T C TCCAT GT TC T CAGC TAT CCGT TC T CAGC AC AGCGG T G T AG ACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CC AG AT T GGAG AC AT C C AG AT G AGGC C C AC GG AG CAG G T G ACC TAGT GAC T G T G G GAG C T C C 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
C AGAACACC AGGGCCT GGCCCAG T G GAT T TCAT GG T GAT CAT T AAAAAAG AAAAAT CGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKWGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACT T C T AC ATG GG CC T CCT G CTGCT GG'i'GC T C T TCCTC AGCC TCC TGCCGG TGGC CT ACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
C CGT T GAG T GC T TG AAATGAG GAAC T G AGAAAAT T AAT T T C T C AT GT AT T T T T C T CAT T TAT T T A 
TTAATTTTTAACTGATAGTTGTACATATTTGGGGGTACATGTGATATTTGGATACATGTATACAA 

TAT AT AAT GAT C AAAT CAGGG T AAC T G GG AT AT C CAT C AC AT C AAAC AT T TAT T T T T TAT TC T T T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 

GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGH G T L GE S PMP FKRVFC QDGNVRS FC VCAVH FS S H Q P P VAVE C L K 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

ammo acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lai 

TT CTGAAGT AACGGAAGCTACCTT GT AT AAAGACCTCAACACTGCTGACCA^GATC AGCGCAGCCTGG AGC 

ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 

TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 

GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 

T T GAAAAACT T GCT G AAAGT AG AAAG AATAT ACC T ATACC AC AAC AGT TT AG AT G AAT TTCCTACC AACCT 

CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 

AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 

TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 

GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 

GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 

TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 

CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 

GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCGTGGTATTGCGGGTGCAAGATGAAATG 

GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 

AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 

ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 

CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 

CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 

ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 

CG AATGTACAAC CCT ACAAC CACCCT C AAT CGAG AGC AAG AG AAAG AACCT T AC AAAAACCCCAATT T ACC 

TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 

TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 

GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 

AAGC AATG AACC CAT CTCG AAGGAGG AGT T TGT AAT AC AC ACCAT ATTTCCT CC T AATGG AAT G AAT C TGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 

GT 
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FIGURE 132 

MI SAAWS I FLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFI YCNDRFLTS I PTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILE IRETS FQMLPI SNEPI SKEEFVIHT I FPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
ammo acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE iaa 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTC ATG TCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTA/^AGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C T C CAC T AC C T C AAGC T GAG T GAT CC CAAG T AC C TAAG AG AG T T C C AG C T GAC C C T CC AGCCC GG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GAC CAC T C C AAC AG AG C C AG GAC T AT AT C AAC CTCTTCTGCGC CAAC AT G ATG G AC T T G AAC C GC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGAT GC T TCGGGGAGCCT GAT GC TGAAGAT GAAGAAT TATC TAAAGC TAT TCAA 
TATCAGCAGCATTTTTCGAGGAGAGTGAAGAGGCGAGAAAAACAATTTCCAGATTCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
G C T G AAAAG CAC T T T C AAAG AG AC T G T G T T G AAT AAAG G G C C AAGG T T C T T G C CAC CC AG CAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCC TAT AC AT C C T G G C AG AAT AC C C C C C AG C AAAC AG AG AGC CAC AC C CAT C CAC AC C G 
CCACCAC CAAGCAGCCGC TGAGAC GGACG GT T CCATGCCAGCT GCC TGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
G G C CGGGAGG GACC CAGG T G T GAACGG AT GAAT AAAG T T CAAC T GC AAC TGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSAiERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLRE FQLTLQPGEWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y I NL FCANMMDLNRRAEAI GYAYPTRD I FMEN IM FCGMGG FS DFYKLRWLE AI LS WQKQQEGC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

ammo acids 10-16, 20-26, 63-69, 208-214 

Ami da t ion site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 



App_ID=10063514 



Page 286 of 320 




.il.. ^ >n «„.tt u„„i If. u ii™ , 



FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T AC AGAT TTGATCCCG T T CGAGT GGATATGACT T C GAAAG GAAAAAT GAGAG CAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TAT GAT GAT GGTTCTTCCTT TAT T GAT AT TTGTGCTTCTG C CT AAAGT GG T C AACAC AAG T GATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACAC T GGG T G GC AT CC AAG T C T T GGAAAACC G T G T GAAGC AACT AC T ATAAAC T T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAAT AT TGATGTCACTGAATT AAT TACAGTGT CC T AT AG AAAAT G C CAT TAATAAAT TAT 
AT GAACT AC T ATACAT TATGT ATAT T AAT TAAAAC AT C T T AAT CC AGAAAT C AAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE i!*6 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKT DGS FWHD IPSGSYWEWS PAYRFDPVRVDI TS KGKMRARYVNY I KT SE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFMNPMVMiyiMVLPLLI FVLLPKWNTSDPDMRREME 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acxds 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Ami da t ion site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 1*7 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTAT ATGC GTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CAC CAACTC GCAC T CAGAC T CT GAAC TCAGACC T GAAAT CT TCT CT TCACGGGAGGCT T GGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAG C T G T G T G ATC GC C AC AAAC C T TC AGG AAAT AC GAAATG G AT T T T C T GAG AT AC G GG G CAG T G 
TGCAAGCCAAAGATGGAAACAT TGACATCAGAATCT TAAGGAGGACTGAGTCT TTGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAAT ACAGCCAGAT T C T GAG T CACT T TGAAAAGC T GGAAC C TCAGGCAGCAGT TGT GAAGGC 
T TT G GGGGAAC T AGACAT T C T T C T GC AAT GGAT GG AG GAGAC AG A ATAG G AG G AAAG T GAT G C T G 
C T G C T AAGAAT A T T C GAG G T CAAGAGC T C C AGT C T T C AAT AC C T GC AG AG GAG G CAT G AC C C C AA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTG CAT GAT T G T C T T TAT G CAT C C C C AAT C T T AAT T GAGAC CAT AC T T G TAT AAG AT T T T 
T G T AAT AT CTTTCTGC T A T T G GAT AT AT T T AT TAG T T AAT AT AT T TAT T TAT T T T T T GC TAT T T A 
AT G TAT T TAT T T T T T TAC T TGGACAT GAAAC T T TAAAAAAAT T CAC AG AT TAT AT T TATAACCTG 
AC T AGAGCAGG TGATG TAT T T T TATACAG TAAAAAAAAAAAAC C T T G TAAAT TCTAGAAGAGTGG 
C T AG GG G G G T TAT T CAT T T G TAT T CAAC T AAGG AC AT AT T TAC T C A T G C TGAT G C T C T G T GAG AT 
AT T T GAAAT TGAACCAATGACT ACT TAGGAT GGGTTGT GGAAT AAG T T T TGATGTGGAAT T GCAC 
ATC TACCT T ACAAT T AC TGACCAT CC CCAGTAGACTC CCCAG TCCCATAAT T G T GTAT C T T CCAG 
CCAGG AATC CTACACG GCCAGCAT GT AT T TC T ACAAATAAAG TT TT C T T TGCAT AC CAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

ammo acids 1-42 

cAMP- and cC34P-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-rnyristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 

CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTC T TT T ATGGAGATGAAACCTGGGT TAAATTA 
T T C C C AAAG CAT T T T G T G G AAT AT G AT GG AACAAC C T CAT TTTTCGTGT C AGAT T AC AC AG AG G T 
G G AT AAT AAT G T C AC G AG GC AT T T G G AT AAAG TAT T AAAAAG AG GAGAT T GGG AC AT AT T AATC C 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGTGCT GATGAAGATCCACACCT CAC TGCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGT GAT AT C C GAC AT CC AAAG CAC G T CC AATAGAC G GAT GTGGCTGC GAC AC T GG C GAT AGC 
ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
C AAT GAG AG AGC AG T T GAGAT T T T T AC AT T T G AAT AC AG T G C AG C T TAG T AAAC T G T T G C AAG AG 
AAT G T G C C G T CAT AT G AAAAAG AT CCTGGGTTT GAG C AG T T T AAAAT G T C AG AAAG AT T G CAT G G 
G AAC T G GAT C AG AC T G T AC T T G G AGG AAAAG CAT T C AG AAG T C C T AT T CAAC CTGGGCTC C AAG G 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
C ACT CAGGAG GG TCAAAAG GAGAC T T GGTCGCACCAC T CAT CCTGC CACCCC CAG AATG C AT C CT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T CAG AT C CAC AG AG CC CAG GAT CAAGG GAC C CAC T G C AGT GGC AGC AGGAC TGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T G CAC AC AG T A T G T AG T T AC C AAAAG AAT AAAC G G C AAT AAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
T CCC TAT AG AAAACAACTGCCAGCACCT TAAGACCAC T CACACC TTCAGAGTGAAGAACT TAAAC 
C CG AAG AAAT T C AG CA T T CAT G AC C AGGAT C AC AAAG T AC TGGTCCTG GAC T C T G G G AAT C T CAT 
AG C AG T T C C AGAT AAAAAC T AC AT AC GC C CAGAGAT C T T C T T T G CAT T AG CC T CAT C C T T GAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T G T GAC AAG GAT AAAG GAC AAAG T C AT C CAT C C C T T C AG C T G AAG AAG G AG AAAC T GAT G AAG C T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGT GAC AG AT AAAT T T G AG AAC AG GAAAC ACAT T G AAT T T T CAT T T C AAC C AGT T T GC AA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 



MLLLLLE YNFP I ENNCQHLKT TH T FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNY I RPE I 
FFALAS S LS S AS AEKGS P I L LGVSKGE FCL YCDKDKGQSH PS LQLKKEKLMKLAAQKE SARRP F I 
FYPAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



C T AGAG AG T AT AGG GC AGAAGGAT GG C AG AT GAGT GAC T C C AC AT C C AGAGC TGCCTCCCTT T AA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACC ATGC TGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
G G G G AG AAC TAG G T GT GG GAAAT T T G AAG AC GACAT T G AC AAC T GC CAT T T C CAAGAAAGC AC AG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
AC AC AT C T T G AGC C T AAT CAT G T AG T G T AGAT C AT T AAAC AT C AGC AT T T T AAG AAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 14-1 

MLGLPWKGGLSWMLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGH I LNS WKEQVESKTVFSMELLLGRTRCGKFE DD I DNCH FQE S TELNNT FTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

ammo acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



C T G T G C AG C T C GAG GC I' C C AG AG G CAC AC T C C AGAG AG AG C CAAGG T T C T GACG C G ATGAG GAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAG C T T G CC AGGAGGC T G G C AG T ACAGAGCG CAGC AG CGAG CAAAT C C T GGC AAG T G ACC CAGC T 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAAC ACATCAGGC ACTGCG CC ACC TGC T TCACAGT ACT T CC CAACAACT C T T AGAGGTAG 
GTGTATT CCCGTTT TACAGATAAGGAAAC TGAGGCCCAGAGAGCTGAAGTAC TGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
G T C C AAT T C C AG AG CTGTCTGGT GAT CAC T T TAT G T C T CAC AG G GACC C ACA T C C AAAC AT G TAT 
C T C T AAT G AAAT T G T G AAAG C T C C AT G T T TAG AAA T AAAT G AAAAC AC C T G A 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRPCALPSTAQITEAQVAENRPGAFIKQGRK 
LD I D FGAEGNRY YEANYWQFPDG I HYNGC S EANVTKEAFVTGC I NATQAANQGE FQKPDNKLHQQ 
VL WRLVQELC S LKHCE FW LE RG AG LRVTMHQ P VLLC L LAL I W LMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCrTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
C AAC T T T CAG AT T C AG GG G G T ACAT G T GAAG GTTTGTTT TAT GAG TAT AT TG C AT GA T G C T GAGG 
TTTGGGGT 



AppJD= 100635 14 



Page 299 of 320 



:!L0'0-:iS3E;:i;. i ivI- 



FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-rayristoylation site. 

amino acids 62-68 
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FIGURE 1AQ 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATT AAGTGCAGAAAGAC CT TT GAATGAACAGATTGCT GAA 
G C AGAAG AAGAC AAG AT TAAAAAAAC AT AT C CTCC AGAAAACAAG CC AGGT C AGAG C AAC TAT T CT T T 
TGT T GATAACTT GAAC C T GC T AAAGGCAATAACAGAAAAGGAAAAAAT T GAGAAAGAAAGACAATCTA 
T AAG AAGC T CCC C ACT T GAT AAT AAGT T GAAT GTGGAAG AT GT T G ATTC AACCAAG AAT C GAAAACTG 
ATC GATG ATT AT GACT C TACT AAGAGT GGAT T GGAT CATAAAT T T CAAGATGATCCAGATGGTCT TCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATCTCAAAGGAAGCCAACAA 
T TAT GAG GAGG AT CCC AAT AAGC CC AC AAGC T G GAC T GAGAAT C AGGC T GGAAAAAT AC C AGAGAAAG 
T GACTCCAATGGCAGCAATTCAAGAT GGT CTTGCTAAGGGAGAAAACGAT GAAACAGTAT CTAACAC A 
T TAACCTT GACAAATGGCT T GGAAAGGAGAACTAAAACCT ACAGT GAAGACAACTTTGAGGAACTCCA 
AT AT T T C C C AAAT T T C T AT G C GC T AC T G AAAAG TAT T GAT T GAG AAAAAGAAG C AAAAG AG AAAG AAA 
CAC T GAT TACT AT CAT G AAAACACT GAT TGACT TTGT GAAGAT GAT GGT GAAAT AT G GAACAATATC T 
CCAGAAG AAGG T GT TT C CT AC CT T GAAAACT T G GAT GAAAT GAT T GC T C T T C AGACC AAAAAC AAGC T 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
C AG AC AG T ACC AAGG AAGAAG C AG C T AAG AT GG AAAAGG AAT AT G GAAG CT T G AAG GAT T C CAC AAAA 
GAT GAT AAC TC C AAC C C AGG AGGAAAG AC AGAT GAAC C C AAAG GAAAAAC AG AAGC C TAT T TGGAAG C 
CAT CAGAAAAAATATTGAATGGT TGAAGAAACATGACAAAAAG GGAAAT AAAGAAGATT ATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
G AAGC CG AGGC CAT CAAGC G C AT T TAT AG C AGC CT G TAAAAAT GG C AAAAGAT C C AG GAGT CT T T CAA 
CT GT T TCAGAAAACATAATAT AGCTTAAAACACTT CTAATTCTGT GAT TAAAATT TTTTGACC CAAG G 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
G T AAAAAC TAT C T GAAAGT AAAGT T GT AT GT AAGC T GAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEAWNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTL IDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEK5HEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 

CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 

G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 

ATAAT AACCAGC T T CTAGCT GGAGGGC T GCATGCAGGGAAGGT CATTAAAGG TGAAGAGATCAGC 

GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 

CCAG T GC C T G T CAT G TGGGGTGGGG C AGGAGCC GAC T C T AACAC T AGAGC CAG T G AAC AT CAT GG 

AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 

TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 

TGTCAGACTCACCCAGCTTCCCGAG7^ATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 

AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 

GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 

TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 

CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 

CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 

T AAT G G T AAC T GAC AAG T G T T ACC CT GAG C C CC G CAG G C C AAC CC AT C CC CAG T T G AGCC T TATA 

GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 

GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 

TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 

GAAAT G G C T CGAGC T C AGAAG AT AAAAGAT AAG T AGGGT AT G C T GAT C C T C T T T TAAAAACCCAA 

GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 

AG ACC T AC TTACAAAGT GGC AT AT ATTGC AAT TT AT TTT AAT TAAAAGATACCT AT TT AT ATATT 

TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 

AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 

ATTCCTTGTATAAAAATAAGAAAAGAAAT TAATC T TGAGGT AAGCAGAGCAGACAT CATCTCTGA 

TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 

TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 

G TGGC T G GAAT C T C T GGG T AAG G AAC T T AAAGAAC AAAAAT CATC TGG T AAT TC T T T C C T AGAAG 

G AT CACAGC CC C T GG GAT T C CAAGGC AT T GGAT C C AGT C T C T AAGAAG GC T G C T G T AC T GG T T GA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG - 

GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 

GACTGG T TTCC T T G TATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATG TA 

AAG AT GAAG GCAGAGA T CG G AG T T T T G CAG CC AC AAG C T AAG AAAC AC C AAG GAT T G T G G C AAC C 

AT CAG AAG C T T G GAAG AG G C AAAG AAGAAT T C T T C C C TAG AG G C T T T AGAGG G AT AAC G G C T C T G 

CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 

GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 

CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 

TAG AAT AT G GC AGAAG TG AT G G C AT GC C AC T T C C AAG AT TAG G T T AT AAAAG AC AC T G C AGC T T C 

TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 

AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 

GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 

AAG T T G C T CAG TTTTGGTC T AAC T T G T TAT G C AG C AAT AG AT AAAT AAT ATG CAG AG AAAG AG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

ammo acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 



AppJD= 100635 14 



Page 304 of 320 



:l,0 O-ifc 3€ j. -4 „ 'O :S0i :l, OS: 



FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGC'X'CGAGTTAGAATTGTCi'GCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T T GG AG AGAG T G G AG AGAT C AAAG CAAT T GG AGAAC T G GAT TTGCTGTT TAT G T C T C T G AG AAAT 
GCCTGCAT T TGACCAGAGCAAAGC T GAAAAATGAATAAC TAACCCCCTT TCCC T GC TAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AG AC C AG AAG G T AGAC T T T C T AAG CAT AG AT AT T TAT T GAT AAC AT T T CAT T G T AAC TGGTGTTC 
TAT ACACAGAAAACAATT TATTTT T TAAATAATTG TCTT TT TCCATAAAAAAGATTACTTTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T G TAT T T AT TAT TAT TAT AAG AC T G CAT T T TAT T TAT AT CAT T T TAT TAATATGGAT T TAT T TAT 
AGAAACAT CAT T CGAT AT T GC T AC T T GAG T G T AAG G CT AAT AT T GAT AT T T AT G AC AAT AAT TAT 
AGAG C T AT AACAT G T T TAT TTGAC C T CAAT AAACAC T TGGAT AT CCC 
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FIGURE 1K4 

MAALQKSVSSFLMGTIJVTSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFKLT^KEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLT^RLSNRLS 
TCH I E GDDLH I QRNVQKLKDT VKKLGE S GE I KA I GE L DL L FMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 

C CC CC TGGAGAT AT GAGT TGGACAGAGAC T T GAACCGGC TCCC CCAGGACC T GT AC CACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGAC AT GGAT GAT T C T GAG G AGG AAGC T G T TAT T GAAT G T AT AGAGAT T TATCCAAAT AAAT AT 
C T T T AT T T AAAAAT GAAAAA 



App_ID=10063514 



Page 307 of 320 



1/ i^...„p ri.^ji, JJ,,'l «... 



FIGURE is6 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC TCCGAGTAGAACC TGT TACAACTAGTGTTGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 

GAT T T G T G T GACGGGC AAAAGC AAC T TCC AG T C CT AC AGCT G TGT G AGGT GC AAT T AC ACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 

G AGC T G AAC AC AGT C TAT T T CAT T G G G GC CC AT AAT AT TCC TAATG C AAAT AT G AAT G AAG AT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
G T AG AAGT G AAC T T C AC AAC CAC TCCCCTGG G AAAC AG AT AC AT G G C T C T TAT C C AAC AC AG CAC 
TAT CAT CGGGTTTTCT C AG G T G T T T G AGC C AC AC C AG AAG AAAC AAAC G C GAG C T T CAG T G GT GA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGAT CT AT CTAATGTGGAGGCACGAAAGGATCAAGAAGACT T CCT TT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

T CACACAAT T T G T T AC T T CACT GAAT T TC T TCAAAACCAT T G C AGAAG TG AG G T CATC C T T GAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G T GAT C T AAG AAG C CAG AT T CAT C T G CAC AAAT AC GTGGTGGTC T AC T T TAG AG AG AT T G AT ACA 
AAAGACGAT TACAAT GC T CT CAG T GT C TGCCCCAAG T AC CACCTCATGAAGGAT GC CACT GCT T T 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 

ATGGCTGCTGCTCCTTG TAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKEOCCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Ami da t ion site. 

amino acids 488-452 
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FIGURE isq 

AGCCACCAGC(jCAA.CATGACAGTGAA.GACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGTIAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAA GAGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMS RN I E S R S T S P WN Y T VT W DPNR Y P S E WQAQCRNLGC I NAQGKE D I SMNS VP I QQE T L W 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 

AC AC T GG CC AAAC AAAAACG AAAGC AC TCCGTGCTG G AAG TAG GAG GAGAGTCAG GAC TCCCRGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATG CCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGC C T GAAGAT GAG GAAAAG T T TGGAG GAGC AG CT GAC T GAG G G G T GG AG GAG C C TAG G AA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCT GCAG T CAGGC CAG T G T CT GCAGC TAT GGGAC GATGACT T GGGAGCGCT AT GGGCC TG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAA TAAAGGCAGACGCTGT T TTTC TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRJ^ASLOAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVXEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAAiSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL YSADDSG FERL VGALASALCQLPLRVAVDLWSRREL SAQG P VAW FHAQ RRQTLQEGGVWLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 

GGGAGGGCTCTGTGCCAGCCCCG ATGA .GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACAT C C T GAC GTGG GACAGC G G G C CAG AGGG C AC C C C AGAC AC G G T CT AC AG CAT CGAG TATA 

AG ACG T AC G GAG AGAGGGAC TG GG T G GCAAAGAAGGG C T G T CAGCGGA T CAC C C G GAAG T CC TG C 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TC TAC CAC T TAGAGC TC CAGG TC AAC CG CACC TAC C AAAT G CAC C T T GGAG G GAAG C AGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

C C T G GG C CAAGG AG AG T G CC CC C TAC AT G TG CC GAG T GAAG AC AC T GCCAGAC C G GAC AT GGAC C 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

C C C T GG GG AT T T GC AC AGAC AGAACAT C T GAC C C AAAT G T GC T ACACAGT GG G GAG GAAGG GACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGT GC AAGGAGGAAAT GCAGGGAAAC TCCCGAGGT CCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

T T C C C AGC C AGGGC AAC T GC C T G AC GT T G C ACG AT T T CAG C T T CAT T C CT C T G AT AG AAC AAAGC 

GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 

ATCC T GAGAAT GGGGTT TGAAAGGAAG GT GAGGGCT GT GGC CCCT G GACGGG T ACAAT AACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRI TRKS CNL TVE T GNLTEL Y YARVTAVS AGGRSATKMTDR FS S LQHT TLKPPDVTC I S 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16s 



T GGCC TACT GGAAAAAAAAAAAAAAAAAAAAAAAG TCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAG GT GAC GAT GT AT AT G T C T AC T AC AAAC T G GC AG G GG GAT CC C T T GAAC T T T G GGC TG GA 
AG T G T T G AACAC AG T T T T G GAT AT T T T CC AAAAGAT T T GAT CAAGG T AC T T C AT AAAT AC ACG GA 
AG AAG AGC T AC ATAT T CC AGCAG ATG AGACAGAC TTTGTCTGCTT TGAAG GAGG AAG AGATGAT T 
T T AAT AGT T AT AAT G T AG AAGAG C T T T T AGG AT C T T T GG AAC T GGAGGAC T C T G T AC C TGAAGAG 
T CGAAGAAAG C T GAAGAAG T T T C T CAGCACAGAGAGAAAT C T C C T GAG GAG TCTCGGGGGCGT GA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATG C T TACAAAG T C CT GAA?VACAGAAATGAGTC AGAGAGGAAGT GGAC AGTG CGT TAT T 
CAT TAC AGC AAAGGAT T TCGT T GGCATC AAAAT C T AAG T T T G T T T T AC AAAGAT TGT T T T TAGT A 
C TAAGC T GC CT T GGCAG T T T GCAT T T T TGAGCCAAACAAAAAT AT AT TAT TTTCCCTTC T AAGTA 
AAAAAAAAAAA&AAAAAAAA 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of* the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 

GC C T T GAGAAT G AAGCAG AACAG AAG T T AAT AGAGAG CAT G T T G CAAAAC C T G ACAAAAC C C G GG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 

T G C C T G C C C AGAT C T C T AC C AGT GG T C T GAT GG AAG C AAT T C C CAG T ACC GAAAC T G G T AC AC AG 
ATGAACCTTCCTGCGGAAGTGAA7WVGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 

CTTGGGGGTCCC T ACC T T T ACC AG T G GAATGATGAC AG GT G T AACAT G AAGC AC AAT TAT AT T T G 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAG TAAAGGAAGAACAAAAACTAGT CCAAAC CAG T CT ACACTGTGGAT T T CAAAGAG TA 
CCAG AAAAG AAAGT GG CAT G G AAG T ATAAT AAC T CAT T G AC T T G GT T C CAGAAT T T T GT AAT T C T 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAG AT G AAC T G T AAGC TCCCCCTT G AG GC AAAT AT T AAAG T AAT T T T TAT AT G T C TAT TAT T T CA 
T T T AAAGAATAT GC TGT GC T AAT AAT GGAGT GAGACATGC T TAT T T T GC TAAAGGAT GCACCCAA 
AC T TCAAAC T T CAAGCAAAT GAAAT GGACAAT GCAGAT AAAGT T GT TATCAACACGTCGGGAG TA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
T G TAT AT T G TAT T GAAAT T T AC AG T G T GC AAAAG TAT T T T ACC T T T GC AT AAG T G T T T G AT AAAA 
ATGAAC TGT TC TAATAT T TAT T T T TATGGCATC T CAT T T T T CAATACAT G C T C T T T TGAT TAAAG 
AAACT TAT T AC T GT T GT C AAC T GAAT T CAC ACACACACAAAT ATAGT ACC ATAGAAAAAGT T T GT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT AAG AAG C T AT T T CAT T AAGT G T GAT AT AAAC C T C C T CAAACAT T T TAC T T AGAG G C AAGG AT 
T G T C T AAT T TCAAT T G T G C AAG AC AT G T G CC T TAT AAT TAT T T T T AGC T T AAAAT T AAAC AGAT T 
T T G T AAT AAT G T AAC T T T G T T AAT AG G T G CAT AAAC AC T AAT G CAG T C AAT T T G AAC AAAAG AAG 
T G ACAT AC ACAAT AT AAATC AT AT G T C T T CACACG T T GC C T AT AT AATGAGAAG CAGC T C T C T GA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATT GACACT GGAGGCAGATAG T T GCAAAG T TAGTC TAAGG T T T CCC TAGC TG TAT T TAG CCT C T G 
AC T AT AT TAG TAT AC AAAG AG G T CAT G T G G T T GAG AC CAG G T GAAT AG T CAC TAT CAG T G T G GAG 
AC AAG CAC AG CAC AC AGAC AT T T T AG G AAGG AAAG G AAC TAC GAAAT C G T G T G AAAAT G G G T T G G 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAG 1 1 PNL I YWI PTI PLLLL I LVAFGTCCFQMLHKSKGRTKTS PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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